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Essential  Pattern  and  Sequence  Sensitivity 
in  Semiconductor  Memories 

A.  Tuszynski 
Abstract 

Pragmatic  classification  of  errors  linked  to  a  study  of 
Essential  Pattern  and  Sequence  Sensitivity  leads  to  a  new  and 
credible  strategy  for  the  validation  of  individual  chips  and 
entire  memories.  Three  categories  of  errors  are  recognized: 
hard,  non- random  soft  and  random  soft  errors.  Testing  require¬ 
ments  for  hard  errors  vary  linearly  with  the  number  of  storage 
bits.  Exhaustive  testing  for  NON-RANDOM  soft  errors  is 
infeasible,  because  test  requirements  grow  exponentially  with  the 
number  of  bits.  And  finally,  individual  testing  of  reasonably 
good  chips  for  RANDOM  SOFT  ERRORS  does  not  make  any  sense  at  all . 
That  is  why  one  must  speak  of  validation  rather  than  testing. 

Testing  as  such  is  addressed  to  hard  errors  only.  Non- random 
soft  errors  are  tracked-down  by  adverse  analysis  and  diagnostics, 
to  be  eliminated  in  due  course  at  the  design  and  fabrication 
level.  Random  soft  errors  can  be  dealt  with  only  by  means  of 
error  correction  circuitry.  This  may  be  expensive  in  terms  of 
hardware. and  access  time  but,  once  it  is  installed,  the  said  cir¬ 
cuitry  will  provide  protection  against  all  kinds  of  discrete  errors. 
( 

^-/Progress  in  the  verification  of  VLSI  devices  is  predicated  on 
the  availability  of  diagnostic  instrumentation.  Three  promising 
techniques  are  being  developed:  1)  Stroboscopic  Scanning 
Electron  Microscopy  2)  the  Zero-Capacitance  Probe  and  3)  Pseudo 
Analog  Testing.  The  first  has  been  on  the  horizon  since  at  least 
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1968  and  that  is  where  it  still  appears  to  be.  The  second  is 
well  known  as  a  relatively  slow  (50  kHz)  electrometer;  we  have 
extended  its  response  into  the  VHF  range.  The  third.  Pseudo 
Analog  Testing,  was  developed  by  us  from  scratch;  it  is  unwieldy 
but  it  often  yields  quantitative  results  where  everything  else 
fails . 

More  attention  must  also  be  given  to  the  matter  of  process 
statistics.  Worst  case  analysis  cannot  be  executed  without  an 
appropriate  data  base.  The  performance  margin  of  dynamic 
memories  depends  very  directly  on  the  uniformity  of  the  fabrica¬ 
tion  technology.  Many  chips  fail  because  of  a  single  soft  bit. 
Clarification  of  the  nature  of  these  failure  would  probably  boost 
the  yield  and  improve  the  rea liability  of  memory  chips. 

Lastly,  much  remains  to  be  done  at  the  system  level.  We  con¬ 
sider  in  appendix  2  error  correction  of  the  Hamming-code  variety 
as  a  particularly  efficient  technique  of  fault- tolerent  design 
and  we  discuss  chip  and  system  architectures  for  reduction  of  the 
risk  of  burst  errors,  but  we  feel,  just  the  same,  that  block- 
oriented  justification  ought  to  be  used  in  VI  SI  electronics. 

-  All  in  all  we  advocate  1)  conventional  testing  for  hard 
errors  attributable  typically  to  stuck-at- faults ,  2)  extensive 
analysis  and  diagnostics  for  non-random  soft  errors  produced  more 
often  than  not  by  crosstalk  manifested  as  Pattern  or  Sequence 
Sensitivity,  and  3)  block-oriented  error  correction  for  random 
errors  induced  most  often  by  alpha  particles  and  cosmic  rays. 
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1.0  Nature  of  the  VLSI-Test  Problem 

Rational  approaches  to  VLSI  testing  are  evolving  disappoin¬ 
tingly  slowly.  Many  test  engineers  still  cling  to  the  "exhaustive 
testing"  notion,  failing  to  realize  that  exhaustive  testing  was 
nothing  but  an  illusion  even  in  the  days  of  discrete  components. 

In  the  VLSI  era,  it  constitutes  a  veritable  absurdity;  the  need 
for  an  "intelligent  alternative  to  exhaustive  testing"  is 
pressing  indeed  [1]. 

What  then  is  the  nature  of  the  quandary?  Some  will  argue 
that  there  are  more  components  but  fewer  access  points  then  in 
the  past;  testing  is,  therefore,  more  difficult  than  it  used  to 
be.  That  is  true  enough,  but  accessibility  can  be  regained 
through  1)  design  for  testability  in  the  Level  Sensitivity  scan 
fashion  2)  the  flip-chip  packaging  technique  [3,4].  IBM  has 
used  both  measures  very  successfully.  One  concludes,  therefore, 
that  accessibility  alone  does  not  present  an  insurmountable 
difficulty.  Nor  does  the  sheer  number  of  components  pose  an 
unsurmountable  barrier;  modularity  and  hiererchical  design  take 
care  of  the  numbers  issue,  provided  only  that  the  integrity  of 
the  interconnections  as  well  as  that  of  the  modules  is  beyond 
doubt,  before  and  after  they  have  been  placed  on  the  chip. 

There  is  more  to  the  last  proviso  than  meets  the  eye.  VLSI 
is  predicated  on  low  power  density  and  that  translates  into  high 
impedance  levels.  VLSI  also  implies  high-packing  density  and  una¬ 
voidably  high  parasitic  elements.  The  combination  of  high  impe¬ 
dances  and  high  parasitic  elements  brings  about  the  real  problem 
of  VLSI. 


i 
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The  intrinsic  problem  of  Large  Scale  Integration  is  cross¬ 
talk.  With  a  million  transistors  on  a  die,  it  is  impossible  -  at 
the  present  state  of  the  art  in  circuit  design  and  wafer  fabrica¬ 
tion  -  to  guarantee  freedom  from  parasitic  interactions.  To  put 
it  differently,  we  can't  be  sure  that  the  funtionality  of  tran¬ 
sistor  r  will  not  be  upset  by  the  data  flow  through  transistor  s. 
As  a  matter  of  fact,  aboundant  evidence  of  such  crosstalk  is 
readily  available  [5].  That  is  why  one  talks  of  Pattern 
Sensitivity,  Sequence  Sensitivity,  Parasitic  Oscillations,  and 
what  have  you. 

Parasitic  oscillations  are  not  discussed  in  this  report; 
contrary  to  popular  belief,  they  do  occur  occasionally  in  digital 
circuits  -  especially  in  ECL,  as  demonstrated  in  figure  1  -  but 
classical  forms  of  this  problem  are  unquestionably  analog  in 

nature  and  are  best  discussed  with  reference  to  analog  systems. 

* 

Our  immediate  interest,  on  the  other  hand,  is  limited  to  unde¬ 
sirable  peculiarities  of  digital  circuits.  Random  Access 
Memories  in  particular.  We  begin  with  a  classification  of  faults 
and  errors,  and  follow  through  with  an  exposition  of  newly  deve¬ 
loped  tools  for  the  detections  of  the  said  peculiarities. 
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Fig.l:  Parasitic  Oscillations  in  Commercial  ECL 
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2.0  Classification  of  Errors 

Our  classification  of  errors  is  intended  to  be  pragmatic  in 
nature.  Presented  in  the  fashion  of  figures  2a,  2b  and  2c,  it  is 
supposed  to  simplify  the  exposition  of  techniques  for  the  detec¬ 
tion  and  correction  of  various  errors.  We  postulate  implicitely 
in  figure  2a  that  a  device  is  either  good  or  bad  -  a  suspect 
device  is  bad  -  but  it  may  be  good  at  one  point  in  time  yet  bad 
at  another;  this  is  our  definition  of  intermittence.  To  assert 
that  a  device  is  intermittent  you  need  the  outcomes  of  at  least 
two  indisputebly  valid  tests,  the  first  showing  the  device  to  be 
bad  and  the  second  showing  it  to  be  good. 

For  the  next  step,  we  separate  hard  errors  from  soft,  the 
latter  being  further  subdivided  into  random  and  non-random 
errors.  Hard  errors  are  manifestations  of  permanent  solid 
faults,  classical  examples  being  furnished  by  "stuck-at- faults" 
[6-9]  attributable  more  often  than  not  to  shorts  and  breaks  in 
interconnect  matrices.  Conversely,  soft  errors  are  obscure 
rather  than  solid.  They  can  be  random  or  non- random  and, 
observed  by  the  untrained  eye,  may  be  mistaken  for  symptoms  of 
intermittent  faults. 

So  much  for  our  nomenclature.  Proposed  IEEE  standards  [10] 
speak  of  hard,  soft  and  medium  errors.  "Hard  errors  are  mani¬ 
fested  by  cells  which  will  not  properly  store  data  under  any  con¬ 
dition  of  test  operation".  The  term  "soft  errors"  is  reserved 
for  "infrequent"  random  errors  -  alpha  particle  induced  errors, 
for  example  -  while  the  "medium"  cognomen  is  supposed  to  cover 
all  errors  other  than  hard  or  soft. 
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Whichever  of  the  two  languages  is  used,  some  difficulties 
will  be  encountered.  Bridgind  faults  (see  figure  2b)  afford  a 
conspicuous  example,  the  word  “bridging"  implying  unbidden  physi¬ 
cal  coupling  of  two  devices.  They  are  solid  enough,  since 
nothing  in  a  circuit  could  be  more  solid  than  a  permanent  short 
of  two  nodes  -  yet  the  symptoms  may  be  confusing.  That  is  why 
figure  2  shows  them  at  the  interface  of  hard  and  soft  faults. 

Also  troublesome  are  "open-gate"  faults  in  MOS  circuits  and  "open 
emitter"  faults  in  bipolar  TTL;  both  may  display  relatively  slow 
parametric  drift  as  well  as  fairly  abrupt  binary  changes. 

In  spite  of  these  complications,  our  classification  of  errors 
is  simple  and  constructive'.  Fundamental  is  the  separation  of  hard 
errors  from  soft  and  just  as  important  is  the  distinction  between 
random  and  non- random  soft  errors.  In  what  follows,  we  speak 
mainly  of  hard  errors,  random  soft  errors  and  non-random  soft 
errors  (figure  2c). 


•r 


2.1  Hard  Errors 


Hard  errors  are  brought  about  by  manufacturing  flaws  which 
are  solid  and  discrete:  a  broken  link  in  component  "i"  precipi¬ 
tates  errors  which  are  observable  at  the  output  of  "i"  and,  more 
specifically,  a  ground-short  at  the  base  of  transistor  j  induces 
errors  observable  at  the  collector  of  j.  Crosstalk  is  specifi¬ 
cally  excluded  from  the  notion  of  hard  errors.  That  is  why  one 
arrives  at  a  simple  and  convincing  scheme  of  testing  for  hard 
errors.  Checking  a  1-by-N  memory  chip  for  hard  errors  one  needs 
to  demonstrate  that: 

1.  there  are  indeed  N  distinct  addresses,  and 

2.  each  cell  can  be  switched  from  a)  an  unknown  state  to 
LOW,  b)  from  LOW  to  HIGH,  and  c)  from  HIGH  to  LOW. 

For  this  we  need 

x  =  31og2N  (1) 

frames  of  N  test  each,  for  a  total  of 

X  =  3Nlog2N  (2) 

trials . 

Increasing  the  memory  capacity  from  N  to  2N  gets  us,  from 
equation  2 

X(2N)  =  3(2N)log2 (2N) 

=  3(2N)[log2N  +  log^] 

=  3(2N)[log2N  +  1] 

Comparing  now  3c  with  2,  we  find  that 


(3a) 

(3b) 

(3c) 


(4) 


X(2N) 

xTnT 


2[1  + 


1 

log2N 


] 


It  follows  without  further  ado  that  X(M)  is  roughtly  proportional 
to  M  if  M  is  reasonably  large  (say  128  or  more).  This  -  linked 
to  the  fact  that  hard  errors  are  speed  and  temperature  indepen¬ 
dent  -  means  that  testing  for  hard  errors  will  never  pose  any 
undue  difficulties,  not  in  LSI,  or  VLSI  or  even  ULSI.  Trouble 
begins  only  when  test  requirements  begin  to  translate  into  poly¬ 
nomial  or,  worse  still,  exponential  functions  of  M;  this  we  will 
see  is  the  rule  in  the  case  of  some  classes  of  soft  errors. 


2.2.0  Soft  Errors 

The  simplified  version  of  our  classification  recognizes  only 

hard  errors  and  soft;  intermittent  and  bridging  errors  are  pushed 

into  the  background  while  errors  camouflaged  by  intentional 

« 

redundance  are  explicitely  excluded  from  our  considerations. 
Consequently,  that  which  is  not  hard  -  i.e.,  solid  and  relatively 
easily  observable  -  necessarily  falls  into  the  soft  category. 

Soft  errors  are,  therefore,  vague  and  elusive,  their  manifesta¬ 
tion  depending  on  temperature,  clock  speed,  environmental  factors 
and  data-flow  patterns. 

To  resolve  the  vagueness  of  soft  errors  we  begin  with  a  basic 
question  of  uncertainty:  'are  the  observed  phenomena  repro¬ 
ducible?  Can  their  existence  be  demonstrated  on  demand?  Are 
they  or  -re  they  not  controllable?  If  the  answers  are 
affirmative,  a  deterministic  regime  prevails;  if  not,  randomness 
is  the  order  of  the  day.  As  shown  in  appendix  1  (reference  1) 
random  errors  can  be  separated  from  their  non- random  counterparts 
in  spite  of  the  obscure  nature  of  soft  faults. 

2.2.1  Random  Soft  Errors 

The  common  concept  of  randomness  applies  to  the  issue  of 
errors.  Errors  are  said  to  be  random  if  they  are  sporadic  and 
cannot  be  reproduced  on  a  one-by-one  basis;  their  time-average 
rate  of  incidence  may  or  may  not  be  controllable.  Separation  of 
random  errors  from  deterministic  mishaps  (appendix  1)  rests  on 
the  observation  that  an  electronic  chip  does  not  remember  inci¬ 
dental  events  indefinitely.  The  functionality  of  a  chip  in  the 
soft-error  sense  (latch-ups  are  hard)  may  depend  on  events  of  the 


last  microsecond,  but  it  is,  surely,  independent  from  data  pro¬ 
cessed  a  year  ago.  Our,  as  yet,  unconfirmed  estimate  places  the 
unintentional  recall  of  Silicon  Integrated  Circuits  at  less  than 
a  minute.  This  being  so,  we  can  examine  the  character  of  every 
malfunction.  Once  an  error  is  detected  we  interrupt  the  regular 
proceedings,  returning  to  the  state  of  60  seconds  ago,  and  then 
replay  the  events  of  the  last  minute. 

The  causes  of  random  soft  errors  are  usually  found  in  the 
environment  including  the  atmosphere  on  one  hand  [11],  and  the 
package  in  which  the  chip  is  housed  on  the  other  [12,13].  The 
effect  of  both  can  be  reduced  significantly  by  encapsulation  of 
the  chip  [14-16],  but  complete  elimination  of  radiation  effects  is 
absolutely  impossible.  One  needs  single-bit  error  correction 
[17]  and  one  needs  to  do  something  about  burst-errors.  Chip  and 
system  architecture  for  the  reduction  of  the  risk  of  bursts  is 
discussed  in  appendix  2  (reference  18).  Testing  of  individual 
chips  for  random  soft  erroirs  is  pointless;  one  depends  entirely 
on  preventive  measures  and  error  correction. 

2 .2. 2.0  Non-Random  Soft  Errors 

Talking  of  non-random  soft  errors  one  refers  willy-nilly  to 
marginalities ,  that  is  performance  marginalities  attributable  to 
circuit  design  flaws  and  fabrication- process  fluctuation.  After 
all,  one  talks  of  devices  which  work  well  under  most 
circumstances;  when  they  do  fail,  they  fail  in  a  deterministic 
fashion  (by  definition),  though  the  actual  mechanics  of  failure 
may  be  quite  obscure.  Obscure  or  not,  the  problem  usually  boils 
down  to  crosstalk  manifested  as  sequence  sensitivity  or  pattern 


sensitivity.  The  former  refers  to  undesirable  phenomena  in  the 
time  domain  while  the  latter  puts  the  accent  on  topological 
peculiarities.  Our  present  discussion  is  confined  to  Pattern 
Sensitivity.  Sequence  Sensitivity  is  examined  in  appendix  3 
(reference  19). 

2. 2. 2. 1.0  Pattern  Sensitivity 

Examples  of  Ordinary  Pattern  Sensitivity  are  easy  to  come  by. 
Consider,  for  example,  the  matter  of  access  time.  This  parameter 
varies  to  a  certain  extent  from  chip  to  chip  and,  more 
importantly,  from  one  address  to  the  next.  There  exists  a  worst 
case  condition,  but  identification  of  that  condition  may  be 
difficult.  What  we  are  faced  with  then  is  a  search  for  a  key  to 
the  "worst  case"  situation.  He  who  has  the  key  can  generate  an 
efficient  test  procedure,  but  he  who  does  not  have  it  will  probe 
in  the  dark  talking  of  Pattern  Sensitivity.  Unduly  optimistic 
specifications  aggravate  the  problem,  provoking  arguments  about 
marginal  units,  inflating  the  cost  of  testing  and  ultimately 
degrading  system  reliability. 

Also  to  be  considered  is  blatant  abuse  of  the  Pattern 
Sensitivity  cognomen.  Switching  noise  on  external  data  buses  and 
supply  lines  constitutes  a  case  in  point.  The  field  engineer 
blames  the  vendor  rather  than  himself  for  the  cause  of  trouble, 
using  Pattern  Sensitivity  as  a  convenient  all-encompassing 
excuse . 

The  term  Essential  Pattern  Sensitivity  (EPAS)  is,  therefore, 
introduced  to  separate  matters  of  processing  and  circuit  design 
from  superficial  issues. 


2. 2. 2. 1.1.0  Essential  Pattern  Sensitivity  (EPAS) 
Essential  Pattern  Sensitivity  can  be  viewed  as  a  form  of 
crosstalk  which  combined  with  performance  marginalities  results 


under  certain  circumstances  in  errors  of  binary  character.  It  is 
said  to  exist  when  a  chip  which  passes  successfully  one  complete 
test  can  be  shown  to  fail  consistently  another  test.  A  chip 
which  passes  a  comprehensive  "checkerboard"  test  though  it  fails 
a  "moving  diagonal"  test  does  display  Essential  Pattern 
Sensitivity  (see  appendix  3  for  Schmoo  plots  of  GALPAT  and  Column 
Disturb  runs).  Details  of  two  examples  of  Essential  Pattern 
Sensitivity  are  given  below,  the  first  to  illustrate  the  fragi¬ 
lity  of  the  Dynamic  RAM  Sense  Mechanism  and  the  other  to  show 
that  plain  and  simple  design  flaws  do  show-up  in  commercial 
devices . 

The  BIT-LINE  circuitry  of  a  16k-RAM  of  the  Mk4116  class  is 

given  in  figure  3;  its  operation  is  described  in  appendix  2  and 

reference  20.  Here  we  note  that  both  sectors  of  the  BIT-LINE  are 

precharged  to  almost  VQD  (say  vDpp)  during  P2  at  the  beginning  of 

a  READ  operation.  The  absolute  value  of  is  immaterial  but 

DDP 

the  two  sectors  (left  and  right)  of  the  BIT-LINE  are  supposed  to 

be  at  the  same  potential.  Some  discrepancy  will  show  up, 

however,  because  of  differences  between  the  precharge  transistors 

Q  ,  and  Q  .  We  refer  therefore  to  and  V-  where  for  now 

pi  pr  D1  Dr 


(V 


D1 


-  VDr>p  “ 


(5) 


AVp  being  referred  to  as  the  precharge  offset.  Another  discre¬ 
pancy  arises  because  of  leakage  through  the  gating  transistors 
( Q j , ect . ) ;  this  descrepancy  is  pattern  dependent.  Roughly 


Fragnent  of  BIT-LINE  Circuitry  of  RAMs  a  la  MK4H6 
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speaking,  current  leaks  through  a  gating  transistor  only  if  the 
top  plate  of  the  corresponding  storage  capacitor  is  at  a  LOW 
pc  .ential.  Thus  with  n  LOWs  on  the  left  and  m  LOWs  on  the  right, 
there  will  be  a  leakage  offset  AV^  of  magnitude 

AV£  =  (m  -  n)AVii  (6) 

where  AV^  stands  for  the  average  leakage-offset  of  single  cell. 

Yet  another  discrepancy,  denoted  by  AVQ,  arises  because  of 
Sense-Amplifier  offsets  and  Gating-Transistor  inconsistencies. 

The  total  error  (i;Ve)  amounts,  therefore,  to 

4S  “  "d!  -  Sr>e  <7‘> 

=  AV  f  AVq  +  (m  -  n)AVii  (7b) 

The  nominal  signal  voltage  produced  in  response  to  a  left-side 
address  is  either 


A  V  ( H )  =  = - t~F~  vnn 

sg  cbr  +  crr  DD 


(figure  4) 


C 

rr 

—  V 
cbb  DD 


Av  ,L)  =  CsjCbr  ~  SrrSjl _ 

Vsg(L)  ~  TC^  +  C sjMCbr  +  Crr)  VDD 


c  .  -  c 

sj  rr 


where 


bb  ■  cbl  +  Sj  *  Sr  +  Crr  '  Si  *  Sr 

Parasitic  capacitance  of  the  bitline  (see  figure  4) 


•  Capacitance  of  storage  cell  j 


Fig.  4:  Significant  Capacitances 
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which  the  first  [AVp(i)]  and  third  [ A VQ ( i )  ]  characterize  an 
entire  BIT  LINE,  the  fourth  {AVc(i  left)}  is  associated  either 
with  one  sector  of  a  BIT-LINE  (e.g.,  left  sector  of  BIT-LINE  "i") 
or  an  individual  storage  capacitor  {AVcs(j,i)}  and  the  third 
( AV^ )  is  pattern  sensitive.  The  worst  case  of  A\^  occurs  in  pat¬ 
terns  equivalent  to  the  following  example: 

Location  of  the  bit  to  be  interrogated:  Lower  Sector,  BIT-LINE  "i" 
Voltage  at  location  to  be  interrogated:  HIGH 
Voltage  at  all  other  locations  of  the  Lower  Sector  of 

BIT-LINE  " i " :  LOW 

Voltage  of  all  locations  of  the  Upper  Sector  of  BIT-LINE  "i":  HIGH 
Leakage  error:  AV^(w.c.)  =  (/N  -  1)AV^  (14) 

The  pattern- sensitive  leakage  component  may  but  need  not  be 
significant;  everything  in  error  analysis  boils  down  to  the 
signal- to- noise- ratio  and  the  relative  magnitudes  of  the  indivi¬ 
dual  components  of  noise.  Both  these  issues  are  explored  in  more 
detail  in  section  4.  There  is  abundant  evidence  to  show  that  the 
above  discussed  Pattern  Sensitivity  is  highly  significant  at  ele¬ 
vated  temperatures  [22].  There  is  nothing  one  can  do  about  it  in 
short  order.  The  user  should,  however,  incorporate  the  worst 
case  patterns  into  his  hard-error  high- temperature  tests. 

For  an  entirely  different  example  of  Pattern  Sensitivity  we 
turn  to  figure  5  and  refernece  23.  As  is  always  the  case  with 
dynamic  RAMS,  one  can  READ,  WRITE  or  REFRESH.  The  problems  to  be 
described  are  associated  with  the  REFRESH  operation.  In  this 
context  consider  two  cycles  numbered  "n-1"  and  "n”  respectively. 

[ 

. 
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First,  READ  storage  cell  j  at  cycle  time  "n-l"  and  next 
READ/REFRESH  cell  i  at  time  "n".  Finally,  consider  the  sharing 
of  charge  by  the  capacitances  of  nodes  2  and  3  to  observe  that 
the  outcome  of  the  refresh  operation  on  cell  i  may  depend  on  the 
contents  of  cell  j.  This  constitutes  a  case  of  Essential  Pattern 
Sensitivity  with  a  trace  of  Sequence  Sensitivity. 

Next,  let  data  be  written  into  any  cell  of  time  "n-l"  and  let 
any  other  cell  be  refreshed  at  time  “n" .  Considering  once  more  the 
parasitic  capacitances  of  nodes  1  and  2,  it  becomes  apparent  that 
the  outcome  of  the  refresh  operation  of  time  "n"  depends  quite 
improperly  on  events  which  took  place  at  time  "n-l",  and  this 
gives  us  a  transparent  example  of  Essential  Sequence  Sensitivity. 

We  refer  above  to  ESSENTIAL  Pattern  Sequence  Sensitivities  be¬ 
cause  we  speak  there  of  malfunctions  which  are  directly  related  to  cir¬ 
cuit  design  problems,  PASS  GATE  complication  in  particular.  Availability 
of  the  PASS  GATE  in  addition  to  the  regular  LOGIC  GATE  constitutes  a 
valuable  asset  of  the  MOS  transistor  [24,25],  well  worth  the  price  of 
the  complications  which,  by  the  way,  can  be  resolved  quite  easily. 

One  could  argue,  for  that  matter,  that  most  Essential  Pattern 
and  Sequence  Sensitivity  mishaps  are  inexcussable .  EPASS  can  be 
avoided  or  in  the  worst  case,  eliminated  in  redesign.  One  needs 
however  adequate  diagnostic  tools  as  well  as  an  adequate  data 
base;  diagnostic  tools  which  facilitate  node-by-node  testing  of 
VLSI  chips  and,  a  data  base  which  offers  statistical  fluctuations 
as  well  as  average  values  of  process  parameters. 

Unchecked,  Essential  Pattern  and  Sequence  Sensitivity  begets 
catastrophics  consequences.  Since  the  functionality  of  the  hard¬ 
ware  may  depend  on  the  very  data  which  is  being  processed,  test 
requirements  become  utterly  unrealistic. 
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2. 2. 2. 1.1.1  The  Incidence  of  Errors  as  a  Function  of  the 
Data  Which  is  Being  Processed 

A  fault  will  only  be  detected  by  an  appropriate  set  of  test 
vectors.  If,  for  example,  input  A  in  the  AND  GATE  of  figure  6  is 
physically  shorted  to  ground  (stuck  at  zero)  nothing  abnormal 
will  be  detected  at  the  output  terminal  by  means  of  test  vectors 
A  =  0,  B  =  0;  A  =  1,  B  =  0;  or  A  =  0,  B  =  1.  The  existence  of  a 
s tuck-at-zero  problem  can  be  disclosed  only  by  the  A  =  1,  B  =  1 
vector.  Speaking  of  combinatorial  logic  and  testing  in  general, 
one  can  assert  that  a  fault  which  changes  the  transfer  func¬ 
tion  of  a  circuit  from 


fc  “  fl(xl'x2 . xn) 

to 


fc  =  f 2  ( x2  '  x2 . 


will  be  divulged  by  test  vector 


a  -  {alfa2» . • • *an^ 

=  ( X.  /  X _  ,  .  .  .  ,  X  } 

lz  n 


only  if 


f1(a)  +  f2(a)  =  1 


(14a) 


(14b) 


(15a) 

(15b) 


(16) 


Thus,  manifestation  of  faults  is  always  data  dependent  in  the 
sense  that  appropriate  test  vectors  must  be  applied  if  a  par¬ 
ticular  fault  is  to  be  disclosed.  There  is  however  more  to  data- 
dependence  than  the  matter  of  test  vectors.  As  we  have  shown, 
the  very  functionality  of  a  ciruit  may  depend  on  the  data  which 
is  being  processed  by  it:  a  device  which  passes  an  exhaustive 


i 


(as  far  as  hard  errors  are  concerned)  GALPAT  test  may  well  be 
rejected  by  a  COLUMN  DISTURB  test.  What  is  exhaustive  in  the 
hard-error  domain  is  by  no  means  exhaustive  in  the  soft-error 
context.  Therein  lies  a  crucial  distinction  between  hard  and 
soft  errors.  Essential  Pattern  and  Sequence  Sensitivity  is  the 
principal  cause  of  this  distinction. 

From  now  on  we  will  simply  say  that  hard  errors  are  data 
independent  while  their  soft  counterparts  do  depend  on  the  flo.. 
of  data  in  terms  of  both  Pattern  and  Sequence  Sensitivity. 
Furthermore  the  word  "data",  as  used  here,  refers  to  addresses, 
operands  and  unclassified  alphanumerics . 
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2. 2. 2. 1.1. 2  Numerical  Assesment  of  Pattern  Sensitivity 

Overlooking  issues  of  sequence  sensitivity  we  state  that  the 
contents  of  a  binary  N-bit  memory  can  be  arranged  into 


S  =  2N  (17a) 

=  10* 3N  (17b) 


patterns . 

To  check  all  of  these,  in  the  case  of  a  1-by-N  chip,  one  must 
go  through  at  least 


n  =  nh 


=  NxlO 


.  3N 


(18a) 

(18b) 


test.  Operating  at  the  rate  of  a  million  trials  per  second, 

9  13 

i.e.,  3.6  x  io  trials  per  hour  or  about  3  »  10  test  per  year, 

we  would  need 

X(64k)  =  n/3xl013  (19a) 

=  2xl019'000  (19b) 


years  to  test  a  64k-RAM .  In  one  hour  we  can  test  exhaustively 
for  Pattern  Sensitivity  a  memory  of  no  more  than  a  total  of  40 
bits.  This  is  the  practical  limit  in  exhaustive  Pattern  Sensitivity 
Testing  even  if,  by  exclusion  of  Sequence  Sensitivity,  we  limit 
ourselves  to  the  simplest  form  of  the  pattern  problem. 

This  does  not  mean  that  one  does  not  test  for  Pattern 
Sensitivity.  All  we  need  to  give  up  is  the  concept  of  exhaustive 
testing.  We  analyze,  diagnose  and  eventually  test  in  accordance 
with  our  worst-case-6cenerio. 
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3.0  Numerical  Summary  of  Test  Requirements 
As  outlined  in  section  we  need  at  least 

fi(h.e. )  =  3Nlog2N  (21) 

to  check  for  hard  errors.  Allowing  for  crosstalk  of  the  nearest- 
neighbor  type,  one  boosts  the  test  requirements  to 

£2(n.n.)  =  24Nlog2N  (22) 

In  both  of  the  above  tests  there  is  essentially  a  linear  rela¬ 
tionship  between  the  test  requirements  and  the  capacity  of  the 
memory  chip.  However,  a  three-halves  relationship  results,  i.e., 

n(c./N)  =  kjN3/2  (23) 

if  crosstalk  between  all  bits  in  a  row  or  a  column  or  on  a  diago¬ 
nal  is  consider.  Simple  crosstalk  between  any  two  bits  leads  to 
the  N- SQUARE  law, 

£}(c.N)  =  k2N2  (24) 

and  finally,  Pattern  Sensitivity  brings  forth  the  exponential 
law, 

fi(p.s)  =  Nx2N  (25) 

Needless  to  say,  practical  testing  of  VLSI  devices  must  be 
limited  to  the  linear  cases.  That  means  hard  errors,  proximity 
effects  and  identified  worst  case  scenarios  of  the  Pattern  and 
Sequence  Sensitivity  category. 
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4.0  Diagnostic  Tools 

High  packing  density  is  predicated  on  low  power  density  and 
that,  in  turn,  leads  to  high  impedances  and  susceptibility  to 
external  loads.  VLSI  circuits,  MOS  devices  in  particular,  cannot 
be  tested  by  means  of  conventional  test  techniques.  New  tools 
are  required  and  three  different  types  are  becoming  available. 

The  first,  the  Strotoscopic  Scanning  Electron  Microscope  (26,27) 
is  being  developed  by  Bell  Laboratories,  IBM,  Siemens  and 
Fujitsu.  The  projected  price-tag  is  $200k,  but  the  long-awaited 
instrument  is  still  not  available.  The  second,  a  microprobe 
characterized  by  a  very  low  input-capacitance  has  been  available 
for  some  time  in  various  audio- frequency  versions  (28,29);  we 
have  extended  the  response  of  these  probes  into  the  VHF/UHF 
range.  The  cost  of  our  probe  falls  below  the  $500  mark.  The 
third,  a  technique  named  Pseudo  Analog  Testing  has  been  developed 
by  us  from  scratch. 


Dealing  with  large  circuits,  one  can't  overemphasize  the 
importance  of  power  economy;  the  concept  of  VLSI  is  predicated  on 
low  power-density:  with  a  million  transistors  on  the  chip  just 
one  micro-ampere  per  transistor  results  in  5  watts  at  5  volts,  and 
that  calls  for  water  cooling,  an  inacceptable  demand  for  general- 
purpose  work,  current  practice  in  low-density  ECL  and  3D  not 
withstanding.  Consequently  there  is  a  trend  towards  low-voltage 
concepts  (IIL,  ISL,  etc.)  in  the  Bipolar  World  and  sub-micro¬ 
ampere  techniques  in  MOS.  Our  MICROPROBE  is  designed  for  MOS 
work. 

Where  currents  are  low,  impedances  are  high.  The  BIT-LINE  of 
a  16k  dynamic  RAM  is  equal  to  a  stand-alone  1  pF  capacitor  over  a 
major  part  of  an  operating  cycle.  Conventional  oscilloscopes  are 
useless.  We  need  a  probe  whose  input  impedance  is  no  larger  than 
.1  pF  and  we  need  a  response  of  100  MHz  to  boot.  Difficult  as 
these  requirements  may  be,  our  probe  foots  the  bill. 

The  original  schematic  diagram  of  our  probe  is  shown  in 
figure  7a.  What  we  have  there  is  complete  follower:  the  emitter 
follows  the  gate  in  the  usual  fashion  and  the  collector  follows, 
in  turn,  the  emitter,  via  04  and  the  Zener  diode.  Thus  one 
gets  neutralization  of  the  collector-gate  capacitance  as  well  as 
the  emitter-gate  capacitance.  A  discrete  transistor  breadboard 
of  figure  7a  displays  an  input  capacitance  of  .8  pF  and  a  band¬ 
width  of  almost  100  MHz  at  a  total  current  drain  of  20  mA. 

Figure  7b  shows  the  latest  version  of  our  MICROPROBE.  Two 
items  merit  attention:  1)  the  GATE-TO- SUBSTRATE  capacitance  is 
highlighted  and  neutralized  while  2)  the  external  base  resistances 


seen  respectively  by  source  transistor  Q2B  as  well  as  sink  tran¬ 
sistors  Q3B  and  Q3C  are  reduced  -  relative  to  figure  7a  by  means  of 
emitter- follower  transistors  Q7  and  Q8.  SPICE  analysis  of  this 


circuit  -  with  transistors  whose  capacitance  (CgE,  CBC  and 
CjJCASE )  are  each  -suggest  a  potential  voltage  bandwidth  of 
400  MHz  with  a  low-frequency  input  -  capacitance  of  0.02  pF. 


The  bandwidth  is  crucially  important.  To  develop  an  intuitive 


understanding  of  the  problem  consider  a  fixed-delay-system 
(figure  8a)  and  its  response  to  a  voltage- step  input.  Since 
unity  gain  is  assumed  (K  =  1.0),  the  net  charge  transfer  is  zero. 


Nevertheless,  substantial  current- flow  is  induced  in  one  direc¬ 


tion  by  the  input  switch  and  later  in  the  reverse  direction  by 
the  delay  switch.  A  plot  of  the  SPICE  run  confirms  (figure  8b) 
these  speculations  very  nicely. 

Figure  9  shows  bit- line  events  on  a  U  of  M  RAM,  recorded  by 
means  of  a  probe  built  to  schematic  diagram  7a  with  provisions 
for  case-to-gate-capacitance  neutralization  per  7b.  The  probe 
works.  We  can  observe  bit- line  events  by  means  of  relatively 
inexpensive  instrumentation.  As  expected,  switching  noise  is 
conspicuous.  The  glitches  which  follow  the  precharge  pulses  are 
certainly  objectionable.  Work  on  this  topic  continues  and 
Ms.  Jiang  Jialin  is  getting  ready  to  write  a  major  article  on 
Ultrafast-Microprobes . 
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4.2  Pseudo- Analog-Testing  (PAT) 

The  technique  of  Pseudo  Analog  Testing  has  been  conceived  and 
developed  by  us  to  meet  the  pressing  demand  for  a  tool  which 
would  truly  perform  the  functions  of  an  Ultra-Fast  Zero-Input- 
Capacitance  Voltage-Follower.  It  emerged  in  the  course  of 
Gedenken  Experimentation  with  Analog  Testing  of  Digital  Devices, 
another  idea  formulated  by  us.  Details  of  PAT  are  given  in 
appendix  4  and  reference  30.  Here  we  merely  point  out  that  the 
concept  of  Pseudo  Analog  Testing  can  be  put  to  work  in  many  ways, 
some  of  them  non-destructive  in  nature.  We  have,  for  example, 
monitored  the  "best-case"  refresh  time  of  individual  cells  to 
obtain  absolute  and  relative  data  on  the  leakage  currents  of 
storage  capacitors.  There  is  also  dynamic  PAT (31),  the  original 
form  of  which  -  proposed  by  us  in  reference  19  (appendix  3)  -  has 
been  popularized  by  Teradyne  under  the  label  of  "Bump,  Bounce  and 
Rebound  Testing"  (32). 

Pseudo  Analog  Testing  is  limited  in  scope,  but,  in  certain 
cases,  it  yields  credible  quantitative  results  where  other 
methods  fail.  No  specialized  equipment  is  required;  in  contrast 
to  Stroboscopic  Scanning  Electron  Spectroscopy  available  only  at 
the  likes  of  IBM  and  Bell  Telephone,  Pseudo  Analog  Testing  is 
within  the  reach  of  everybody.  Used  as  in  section  5.1,  it  is 
destructive  but  the  loss  of  a  chip  is  the  least  of  our  concerns 
when  VLSI  diagnostics  are  attempted. 
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5.0  The  0  of  H  16k- RAM 

The  U  of  M  16k-RAM  (figure  10)  was  designed  and  processed  in 
cooperation  with  the  Microcircuit  Division  of  the  Control  Data 
Corporation.  This  state-of-the-art  Oxide  Isolation  NMOS  chip 
facilitates  MICROPROBE  as  well  as  PAT  observation  of  circuit 
peculiarities  and  process  statistics. 

There  are  the  usual  128  rows  and  128  columns  but  the 
reference  capacitors  come  in  8  different  sizes.  Twenty-micron- 
square  Aluminum-Pads  over  Field-Oxide  attach  to  the  individual 
section  of  bit  lines  through  five- micron- square  contact  windows. 
Provisions  are  also  made  for  easy  connection  of  PAT  poteniometers 
to  the  precharge  transistors  (nodes  Xg  ,  XL  and  XR  in  figure  3). 

The  U  of  M  chip  is  available,  for  qualifying  research  work, 
from  Mr.  Charles  T.  Naber,  Research  Manager,  Control  Data 
Corporation,  Minneapolis,  Minnesota  55440. 
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5.1  The  Fault  Profile  of  the  U  of  M  RAM 

The  logic  diagram  of  the  Raster-Scan  Memory-Exerciser  deve¬ 
loped  by  us  for  Pseudo- Analog-Testing  is  given  in  figure  11. 
Though  the  scanning  system  always  works  its  way  through  all 
16,384  location,  the  display  can  be  set  to  show  either  the  entire 
chip  or  just  one  quadrant.  The  optical  resolution  of  the  two 
settings  are  displayed  in  photo  1  of  figure  12:  individual  bits 
are  clearly  discernible  in  the  single  quadrant  pictures;  full- 
chip  images  give  a  good  presentation  of  row  and  column  events. 

For  another  preliminary  observation  turn  to  photos  12.5  and  12.6. 
Chip  number  5  was  shot  twice,  once  at  5  p.m.  and  again  at  7  p.m. 

of  5/17/81.  The  two  pictures  are  identical,  as  far  as  one  can 

tell,  demonstrating  that  Pseudo  Analog  Testing  does  deliver 
reproducible  results. 

The  devices  whose  performance  is  displayed  in  photos  12.1 
through  12.13  are  regular  production  chips  (not  U  of  M  RAMs). 
Examining  their  behavior  we  look  -  to  begin  with  -  for  average 
performance  margins.  With  a  HIGH  voltaa*'  (nominally  12V  on  the 
top  plate) ,  errors  in  the  form  of  FALSE  ZEROS  set  in  on  the  lower 
sector  (left-hand  side  of  the  BIT-LINE  in  figure  12.10)  at  40  mV 
with  a  mean  of  about  110  mV;  complete  switch-over  occurs  at  80  to 
160  mV. 

More  details  appear  in  table  1.  The  results  are  consistent 
on  one  hand  and  surprising  on  the  other.  The  "worst  case"  per¬ 
formance  margin  is  40  mV,  The  mean  is  to  100  mV  in  the  set  which 

comprises  chips  number  1,3, 5,6,7,  and -11,  made  by  manufacturer  A. 
Chip  Ml,  processed  by  vendor  M  has  a  mean  of  about  200  mV.  Note 
that  the  worst  case  is  established  at  40  mV  on  chip  7  by  just  one 
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cell;  all  other  cells  on  the  same  chip  -  all  16,383  of  them  - 
have  performance  margins  that  are  at  least  twice  as  high.  Again, 
on  chip  three  (figure  12.4),  the  worst  case  of  60  mV  is  produced 
by  just  one  cell  (different  than  on  chip  7)  while  all  other  cells 
work  to  and  beyond  100  mV. 

The  wide  distribution  of  performance  margins  is  disturbing; 
it  leads  to  soft  errors  and  must,  therefore,  be  corrected.  In 
any  case,  why  should  one  cell  be  much  worse  then  the  rest?  We 
expect  that  minute  "soft"  (non- catastrophic)  process  defects 
-  unduly  thin  Gate  Oxide,  for  example  -  are  at  the  bottom  of  this 
puzzle.  Investigations  of  this  phenomenon  will  be  continued  in  a 
MASTER  THESIS  of  James  E.  Broughton  from  SPERRY-UNIVAC  in 
St.  Paul,  Minnesota. 

To  look  at  a  "soft  defect"  in  reference  cells  turn  to  pho¬ 
tograph  12.7  (chip  #6- ZEROS).  The  whole  lower  (left-hand)  sector 
of  a  bit  line  has  an  unduly  low  performance  margin.  There  is  almost 
definitely  a  flaw  in  the  components  associated  with  the  right- 
hand  storage  cell;  a  Sense- Amp  offset  problem  would  have  shown 
up,  for  examples,  as  a  bad  sector  on  the  left  for  FALSE  ONES  and 
an  equally  bad  right  sector  of  the  same  BIT-LINE  for  FALSE  ZEROS, 
as  in  figures  12.12  and  12.13. 

Apart  from  the  above  noted  exceptions,  the  spread  of  the 
switch-over  voltage  is  contained  to  20  mV.  Concurrently,  the 
reproducibility  evidence  submitted  with  regard  to  chip  #5  in  pho¬ 
tographs  12.5  and  12.6  demonstrates  that  the  pertinent  noise 
levle  is  relatively  low.  One  can  assume,  therefore,  that  the 
20  mV  straggle  is  attributable  to  variations  in  storage 
capacitance.  With  a  mean  of  100  irtV,  we  get  a  typical  variation 
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of  about  20%.  This  is  acceptable,  but  the  aforementioned  "worst 
case"  is  not.  As  already  noted,  the  wide  dispersion  of  capaci¬ 
tance  values  will  lead  to  performance  marginalities  which  preci¬ 
pitate  soft  errors.  Two  action  items  are  called  for: 

1)  Reduce  the  parasitic  capacitance  of  the  BIT-LINE  to 
improve  the  overall  performance  margin,  and 

2)  Clean-up  the  process  to  reduce  cell-to-cell  variations 
of  capacitance. 
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Fig.  11:  Raster-Scan  Memory-Exerciser 
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Fig. 12.1;  Resolutions  of  the  BIT-MAP  display 
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6.0  Reliable  Chips  and  Systems 

System  reliability  begins  at  the  '‘materials"  level.  In 
contrast  to  past  practice,  we  want  epi-wafers,  rather  than  their 
homogeneous  counterparts,  in  order  to  immunize  our  products  as 
far  as  possible  against  malefactions  of  alpha  particles  [33].  We 
also  want  a  low  dislocation  density  to  assure  low  leakage,  but 
most  important  of  all  is  uniformity,  since  poor  uniformity 
invariably  leads  to  Pattern  Sensitivity,  the  deathknell  of 
reliability. 

Processing  must  be  uniform  too:  cell  to  cell,  chip  to  chip, 
wafer  to  wafer  and  run  to  run.  Statistical  straggle  reduces  per¬ 
formance  margins  and  thus  leads  to  hard  faults  or,  worse  still, 
to  soft  errors.  The  matter  of  yield  is  significant.  Low  yield 
implies  poorly  centered  processing,  or  excessive  variations  or 
inadequate  performance  margins.  Either  one  of  these  will  result 
in  low  yield  as  well  as  high  Pattern  and  Sequence  Sensitivity. 

In  circuit  design  and  chip  layout,  most  dangerous  are  minor 
flaws.  Routine  runs  of  CAD  programs  will  never  catch  them.  One 
needs  to  develop  the  technique  of  adversary  analysis  -  a  tech¬ 
nique  aimed  at  the  identification  of  worst  case  combinations  - 
and  one  also  needs  to  establish  a  proper  data  base  for  whatever 
process  one  happens  to  work  with. 

In  architecture,  crucial  is  the  reduction  of  the  risk  of 
burst  errors.  The  chip  as  well  as  the  entire  system  must  be 
designed  with  this  thought  in  mind.  One  already  operates  on  multiple 
operands  for  the  sake  of  low  overheads  [appendix  2]  but  one 
should  go  a  step  further  and  consider  block-organization  of 
memories  for  the  explicit  purpose  of  effective  error  correction. 


Altogether  one  needs  good  epi,  uniform  processing,  wide  per¬ 
formance  margins,  effective  error  correction  and  penetrating 
diagnostics.  More  work  must  be  done  in  the  area  of  process 
satisfies ?  figures  12.1  through  12.13  demonstrate  that  very 
eloquently.  Also  needed  are  means  for  the  quantization  of  pro¬ 
cess  integrity  and  design  quality.  Last  but  not  least,  there  is 
a  call  for  diagnostic  tools  such  as  the  Scanning  Stroboscopic 
Electron  Microscope  and  our  own  Microprobe  or,  as  long  as  nothing 
better  is  available,  the  Pseudo  Analog  Test  Technique. 
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7.0  Conclusion 

Testing  should  not  be  treated  as  an  objective  in  its  own  right. 
Verification  of  performance,  rather  than  testing,  is  the  real 
objective.  It  won't  do  to  argue  that  we  must  test  to  be 
"certain";  there  is  no  certainty,  not  ever.  We  must  confine  our¬ 
selves  to  statistical  expectations,  asking  for  appropriately  high 
confidence  levels  and  no  more.  In  VLSI  one  runs  exhaustive 
general  purpose  tests  for  hard  errors  and  a  few  highly  customized 
tests  for  non-random  soft  errors.  The  latter  operation  is  far 
from  being  exhaustive.  One  deals  with  crosstalk  in  the  form  of 
Pattern  and  Sequence  Sensitivity.  Anything  in  excess  of  30  bits 
(sic)  leads  to  absurd  situations. 

To  curb  Pattern  and  Sequence  Sensitivity,  we  need 
KNOWLEDGE  above  all;  knowledge  of  circuitry  as  well  as  processing 
and  not  just  superficial  knowledge  at  that.  Pattern  and  Sequence 
Sensitivities  are  a  matter  of  marginalities ;  to  eliminate  them, 
we  must  understand  the  subtleties  of  circuitry  and  processing  in 
order  to  identify  worst  case  situations  precipitated  by  theoreti¬ 
cal  circuit  problems  on  one  hand  and  the  very  real  process  pecu¬ 
liarities  on  the  other. 

The  need  for  a  data  base  complete  with  statistical  infor¬ 
mation  will  become  even  more  pressing  as  we  progress  from  the 
present  five  micron  features  to  submicron  dimensions. 
Reproducibility  of  characteristics  does  depend  distinctly  on 
feature  size;  we  may  want  to  trade  size  against  matching.  There 
exist  good  reasons  for  fairly  large  Sense  Amplifiers  in  otherwise 
submicron  memories. 
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Abstract 

The  concept  of  self-learning  testers  is  explored 
Vlth  reference  to  the  detection  of  certain  classes 
of  toft  failures.  appropriately  organized  ma- 

Chlne  acquires  testing  "know-how",  while  exer¬ 
cising  theories  with  a  virtually  endless  sequence 
Of  pseudo  random  test  veccors.  Verification  and 
sorting  of  failures  is  straightforward,  but  imag¬ 
inative  compression  of  the  acquired  information  is 
mandatory.  The  efficacy  of  the  tester  is,  indeed, 
predicated  on  the  notion  of  the  "time  trace"  of  a 
•aquence  of  test  vectors. 


Introduction 

The  testing  of  large  monolithic  circuits  Is  a 
multifarious  affair.  Investigating  hard  fail¬ 
ures,  one  has  to  consider  the  recently  identified 
''Stuck  at  tristate”  problem  (1),  just  as  much  ns 
the  old  "stuck  at  one"  and  "stuck  at  zero"  prob¬ 
lems  (2,3).  But  this  gives  just  a  foretaste  of 
complications  encountered  when  probing  for  soft 
failures.  With  soft  failures  in  the  offing,  LSI 
devices  cannot  be  tested  exhaustively;  willy-nilly, 
they  must  be  tested  intelligently  (8,9).  Indeed, 
every  test  must  serve  a  distinct  worthwhile  pur¬ 
pose  If  a  modicum  of  efficiency  is  to  be  achieved. 
Nonetheless,  there  is  room  for  both  deterministic 
find  Monte  Carlo  techniques.  The  former  are  for 
anticipated  weaknesses,  whereas  the  latter  are 
brought  into  play  when  one  attempts  to  detect 
unforeseen  flaws.  Random  errors,  attributable  to 
such  perturbations  as  extraneous  noise  or  nuclear 
radiation  (10,11),  are  best  monitored  by  a  com¬ 
bination  of  deterministic  and  statistical  tech¬ 
niques. 

Among  the  conspicuous  causes  of  soft  failures  are 
design  deficiencies  (A, 5),  processing  flaws  (6,7), 
find  specifications  malapropisms  (8).  This  is  why 
the  synthesis  of  test  programs  for  LSI  devices 
Should  be  preceded  by  in-depth  studies  of  cir¬ 
cuitry,  topology  and  processing  (9).  Admittedly, 
however,  while  it  takes  time  to  analyze  a  new 
RAM,  experimental  work  should  commence  at  the 
earliest  opportunity.  Hence  comes  the  first 
argument  for  self-learning  testers  as  machines 
Which  can  be  put  to  work  without  procrastination. 

A  second,  entirely  different,  inducement  arises 
from  the  autonomous' nature  of  self-learning 
tenters:  being  independent  of  human  intelligence, 
eclf-lcarnlng  machines  are  likely  to  reveal  that 
Which  man  has  overlooked. 


The  term  "self-learning"  need  not  conjure  a  spec¬ 
ter  of  automatons  or  androides  (12,13);  it  is  not 
proposed  to  bestow  "artificial  intelligence"  upon 
the  tester.  On  the  contrary,  the  sup' — sedly 
ubiquitous  need  for  intelligence  is  being  denied; 
intelligence  is  not  considered  to  constitute  a 
sine  qua  non  prerequisite  of  either  learning  or 
decision  making.  The  argument  is  advanced  that  a 
system,  equipped  with  an  ample  memory  and  a  sen¬ 
sible  sorting  mechanism,  can  acquire  "know-how" 
which  will  enable  it  to  arrive  at  sound  decisions 
most  of  the  time;  it  is  contended  that  such  a 
system  will  keep  track  of  repetitive  phenomena 
£*nd  will  separate  the  reproduciDle  from  the 
random,  quite  efficiently. 

The  Hardware 

The  functional  components  of  the  tester  are  shown 
in  figure  1.  There  is  a  control  and  processing 
unit,  a  multi-phase  clock,  a  random-number  gener¬ 
ator,  an  up-down  counter,  and  there  are  three 
distinct  memories,  in  addition  to  the  DUX.  The 
main  memory  has  R  sectors  to  accommodate  as  many 
fatal  sequences,  that  is,  sequences  which  dis¬ 
qualify  at  least  one  DUT.  Each  sector  has  two 
blocks,  a  regular  RAM  for  initial-state  data  and 
a  FIFO  memory  for  test  vectors.  The  reference 
memory  is  supposed  to  be  fail-proof,  by  virtue  of 
both  majority  voting  and  error  correction  redun¬ 
dance.  The  results  memory  remembers  the  failure 
record  of  individual  PUTS.  The  up-dovn  counter 
supplies  sequential  addresses  on  one  hand,  and  a 
backtrack  capability  on  the  other.  The -purpose 
and  construction  of  the  other  modules  Is  self 
explanatory,  but  the  architecture  of  the  random 
number  generator  docs  deserve  some  elaboration. 

The  random  number  generator  is  built  around  a 
A5-bit  shift  register.  Modulo  two  linear  feedback 
is  employed  to  produce  "maximum  length"  sequences 
of  pseudo  random  numbers  (15,16).  Running  at  a 
rate  of  a  million  words  per  second,  the  generator 
will  furnish  a  one  year's  supply  of  distinct 
A5-bit  numbers.  In  addition  to  the  feedback  taps, 
there  arc  m+1  output  terminals,  one  for  the  random 
"data"  and  ra  for  the  random  address  under  which 
the  data  gets  stored.  The  number  of  address  bits 
is  invariably  smaller  than  A5.  For  this  reason, 
some  addresses  will  reappear  and,  better 
still,  will  reappear  in  different  order  at 
different  times.  This  fulfills  a  randonness 
condition  that  lias  been  overlooked,  apparently, 
by  the  cited  tolerances.  Vet  another  increment 
of  entropy  comes  from  the  truly  random  initial 
state  produced  by  means  of  the  hardware  shown  in 
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Figure  I.  Component.,  of  a  Self-Learning  Tester 

the  upper  part  of  figure  2.  ;  All  in  all  ue  have  in 
figure  2  a  generator  which  produces  an  abundance 
of  credibly  random  sequences. 
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Symbols  and  Numbers 

Our  DUT  is  a  "one  by  M"  RAM,  with  M  taken  as 
65,536  whenever  specific  numbers  are  called  for. 
There  is,  accordingly,  one  data-in  pin,  one  data- 
out  pin,  and  there  are 

m  -  log^M  -  14  (1) 

address  pins.  The  same  applies  to  the  Reference 
Memory,  as  it  must,  if  the  DUT  and  the  Reference 
device  are  to  be  operated  in  parallel. 

The  Main  Memory  is  fairly  large  and  complex. 

There  ate  R  sectors  and-  there  are  two  Blocks  in 
each  sector.  Each  Pattern  Block  has  a  capacity 
of  M  bits,  just  as  the  DUT.  Each  Vector  Block  is 
a  FIFO  memory  which  can  store  n  words  of  m+1  bits. 
The  number  R  gives  the  siz~  of  a  test  batch:  R 
devices  can  be  tested  in  or.e  uninterrupted  run  of 
the  machine.  The  number  n  is  the  yet  to  be 
discussed  "trace-length"  af  a  failure.  The 
capacity  of  the  Main  Memory  is 

CMM  -RxM+Rr.  nx  (m-1)  (2) 

The  Results  Memory  keeps  track  of  the  statistics 
of  the  self-learning  exercise;  it  remembers,  for 
example,  that  DUT  *3  failed  at  least  one  test  In 
sequence  01 ,  one  In  sequence  -121,  and  one  in 
sequence  1*39.  It3  capacity  is 

CRM  -  2  x  R  x  R  bits  (3) 

There  are  two  bits  for  each  of  the  R  tests  per¬ 
formed  on  each  of  the  R  devices. 

The  Range  of  Soft  Failures 

The  issue  of  soft  errors  arises  only  where  the 
integrity  of  a  machine  happens  to  be  data  depen¬ 
dent.  For  example,  an  adder  is  said  tc  exhibit  a 
soft  failure,  if  it  consistently  fouls  up  the 
addition  of  3.87  and  6.45.  though  it  works  cor¬ 
rectly  with  all  other  numbers.  As  one  would 
expect,  a  machine  may  manifest  core  than  one  soft 
error,  and  such  errors  may  be  either  related  or 
independent.  Soft  errors  do  come  in  all  shapes 
and  sizes,  being  brought  about  by  various,  core 
or  less  obscure  causes;  classification  of  soft 
failures  is  frequently  difficult,  and  attempts  at 
generalization  often  lead,  to  confusion.  Our 
immediate  concern  is,  fortunately,  limited  to  the 
distinction  between  Pattern  Sensitivity  and 
Sequence  Sensitivity  as  it  applies  to  memories, 
and  memories  only. 

Since  an  ordered  set  of  M  bits  bas  a  range  of 


numbers,  the  storage  matrix  of  a  64-kbit  memory 
con  be  arranged  into  . 

17(6 Me)  -  265,536  -  1019'728  (5) 

distinct  patterns  of  ones  and  zeros.  Working  at 
a  rate  of  a  million  tests  per  second,  one  performs 

Y(365)  -  3.15*1013  (6) 


Figure  2.  Random  Number  Cencrator 


tests  per  year.  To  chock  all  patterns  which  enter 
into  equation  4,  one  would  work 
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Y(W)  -  3*10 


years 


(7) 


between  pattern  and  sequence  sensitivity  from  an 
altcrnaro  direction. 


19,724 


These  are  illusory  numbers  that  fall  to  enlighten 
the  querist.  Tec,  they  show  up  quite  frequently 
In  LSI  engineering,  since  rhelr  appearance  is  by 
tio  means  limited  to  large  memories.  To  check  all 
patterns  of  n  modest  250-bit  RAM  at  a  rate  of  a 
billion  tests  per  second,  one  would  need 
r.i 

Y<256)  -  3*10  years  (ft) 


end,  es  already  noted,  45  hits  still  take  a  full 
year.  The  concept  of  exhaustive  testing  has  no 
place  In  LSI  devices.  Exhaustive  testing  ends  at 
about  30  bits,  since  it  takes  33  minutes  to  check 
out  a  billion  patterns  at  a  rate  of  1  12Iz. 

pattern  Sensitivity  ve.--  is  Sccucnce  Sensitivity 

The  expression  "Pattern  Sensitivity"  refers  to 
failures  associated  with  individual  patterns  of 
data.  "Sequence  Sensitivity'*,  on  the  other  hand, 
refers  to  failures  brought  about  either  by  execu¬ 
tion  of  specific  sequences  of  operations  or  bv 
processing  of  specific  sequences  of  data  patterns. 
Speaking  of  one  particular  failure,  a  single  data 
word  Is  culled  out  for  pattern  sensitivity,  but  an 
entire  ordered  set  of  either  instructions  or  data 
words  Is  listed  in  the  case  of  sequence  sensitiv¬ 
ity.  Tor  example,  a  pattern  sensitivity  failure 
»f  ft  four  bit  memory  might  be  reported  as 


ft(9)  ~  ps1(1001)  (9) 

but  Sequence  Scnt.tlvity  must  necessarily  show  up 
US  b  fitting  of  Instructions  or  data  words: 

f  (?)  -  ss  (1001/0031/001 3 /OH  1/...)  (10) 

J  J 

Referring  to  sequences  of  "states'*  rather  than 
sequences  of  "operations"  or  "data  words",  it  is 
convenient  to  write  a  sequence  sensitivity  state¬ 
ment  as: 


*fc  -  ee(y‘k.o',Xk,-l/Xi:.-2/-/\.-n> 


(ID 


Where  X,  is  state  zero,  that  is,  the  state  which 
k,o 

Yevcals  the  existence  of  a  problem,  while  X^  Us 

the  first  preceding  state  and  X,  is  the  u-th 

m,-n 

preceding  state.  Equation  11  implies  that,  having 
bhscived  a  failure,  ve  backtrack  n  steps  towards 
the  source  of  trouble.  The  problem  is  supposed  to 
be  Cumulative  rather  than  discrete,  X^.  being 


the  furthermost  significant  state.  A  classical 
Sequence  sensitivity  failure  will  r.-rer,  if  the 
given  sequence  is  replayed,  beginning  at  state 
X.  .  Equation  11  represents  the  "trace"  of 

K 

Sequence  number  "k";  subscript  *'n"  g.ivcs  the 
length  of  the  trace  in  terms  of  machine  cycles, 
while 


L«*nY  (3  2) 

Vhfcte  T  *  period  of  a  machine  cycle 

Jives  the  length  of  the  trace  in  seconds.  It 
remains  to  estimate  r.  and  1.  but,  before  we  do  this, 
S.L  Uay  be  advisable  to  approach  the  distinction 


The  phrase  "a  failure  is  associated  with  state 
Xo"  is  somewhat  Incomplete,  since  a  failure  may  be 
associated  with  attempts  to  either  enter,  or 
leave,  or  maintain  a  given  state.  Only  the  last 
of  these  possibilities  can  be  described  adequately 
by  q  one— word  quotation  of  the  offending  pattern, 
and  only  this  possibility  constitutes  a  static  or, 
more  precisely,  a  quasi-static  situation.  Having 
written  some  data  into  the  memory,  we  simply  want 
to  keep  it  there  and  we  generally  succeed,  except 
for  the  peculiar  case  of  one  specific  data  word 
which  tends  to  degenerate  because  of  leakage  or 
"refresh"  noise.  This  is  pattern  sensitivity. 
However,  in  contrast  to  the  just  discussed  static 
case,  the  other  two  failures  are  dynamic  in  na¬ 
ture.  Ve  may  be  unable  to  either  change  v  . 

A.  t"i 

into  X."  or  X.  into  X,  ...  For  example,  a 
i,o  1,0  1 ,  +1 

one-by-four  RAM  may  manifest  a  failure  when  an 
attempt  is  made  to  go 

from  X.  ,  “  1000 

to  X.  *»  1001 

i,o 

though  the  transition 


from  X.  - 

-  1011 

(13c) 

tO  Xi  „ 

1,0 

-  1001 

(13d) 

Is  invariably 

sound . 

Here  wc  deal  with  a  case  of 

sequence  sensitivity  that  is  particularly  convinc¬ 
ing  if  the  said  failure  is  predicated  on  the 


specific  values 

z  »  0001  (14e) 

X±  _3  -  0000  (14 f) 


’  and  X.  -  3010  (14u) 

i,-u 

recorded  as 

ssi(9)  -  1001/1011/0001 /0000/.../1010  (15a) 

or  csjL(9)  -  0/11/1/0/.  ../10  (15b) 


The  distinction  between  pattern  sensitivity  and 
sequence  sensitivity  may  be  minimal.  It  is  theo¬ 
retically  possible,  though  practically  unlikely, 

that  attempts  to  write  X.  »  9  will  fail  irre- 

1,0 

spective.lv  of  the  choice  of  X.  .  ,  X,  ,,  etc. 

l,“l  1*—^ 

One  can  classify  such  cases  as  pattern  sensitivity 
because  a  single  data  word,  i.o., 

(16) 

does  describe  the  failure  adequately.  However, 
one  can  also  classify  them  as  degenerate  sequence 
sensitivity  because  transitions  are  at  issue, 
though  n  is  equal  to  one  in  the  statement 


(13a) 

(13b) 


P.4S 


ft  -  8Sl<-/9)  (17) 

Regular  sequence  sensitivity  is,  of  course,  char¬ 

acterized  by 

n  >  1  (18a) 

Ct  even  b  »  1  (18b) 

The  number  n  may  vary  widely.  We  do  nor  concern 
Ourselves  with  its  exact  value,  hut  we  ciust  csti- 
toatc  the  upper  bound  of  this  important  number. 

The  length  of  the  Trace  of  a  Failure 

The  present  state  of  a  discrete-tine  system  can  be 
written  in  conventional  notation  (9)  as 

X(0)  -  BU(-T)  +  CX(-T)e-A(T/T)  (19a) 

»  BU(-T)  +  Er  (-2T)c'A(T'/T)  +  C2B(-3T)R"A(2f^T>  + 

+  CnX(-nT)e"A(t:‘T/T)  '  (19b) 

X(0)  evidently  depends  first  on  the  cost  recent 
Stimulus,  then  on  the  situation  at  t  '  -T,  and  only 
eventually  on  events  at  t  =  -nT.  Recollection  of 
pest  events  fades  away  quite,  rapidly,  an  long  as 
the  circuitry  in  in  a  conducting  state.  The  rela¬ 
tive  importance  of  any  particular  state  is  attenu¬ 
ated  by  SO  dbs  in  a  tine  interval 

I.  *•  nT/t  »  10*  (Access  Tice)  (20) 

A 

A  complication  arises,  however,  because  storage 
capacitors  of  "single-transistor"  RAMS  are  main¬ 
tained  in  a  non-conducting  state,  most  of  the  time. 
Time  constants  of  cut-off  circuitry  cay  be  as  long 
«s  30  minuces  or  even  more  (17).  Nonetheless , 
this  is  only  superficially  disconcerting.  The 
cut-off  state  of  dynamic  storage  elements  is 
periodically  interrupted  by  the  "refresh"  mecha¬ 
nism.  We  arc  free,  therefore,  to  think  of  a 
fctquonce  of  refresh  operations  and  to  assert  that 
the  effect  of  any  single  event  wears  oif  in  time 
1^,  Ctjul valent  to  10  refresh  periods: 

ly  -  10*Tr  .  •  (21) 

Host  manufacturers  of  semiconductor  memories  stim¬ 
ulate  a  refresh  time  of  2  milliseconds,  and  most 
users  abide  by  this  requirement.  We  can  assume, 
consequently ,  that  the  trace  of  a  failure,  dees  not 
exceed  20  milliseconds,  even  where  storage  capac¬ 
itors  enter  into  the  act. 

Procedures 

£ach  PUT  is  initialized  with  its  own  random  data; 
addresses  are  called  out  sequentially  by  the  Up¬ 
town  Counter,  but  the  "data  bit"  is  supplied  by 
Che  Random  Number  Generator.  The  binary  states 
Of  the  data  bit  arc  interpreted  as  "change"  and 
"no  change"  rather  than  as  "one"  and  "zero".  All 
■Initialization  data  is  fed  simultaneously  to  the 
J)UT,  the  Reference  Memory,  and  the  appiopriate 
Sector  oi  the  Main  Memory.  A  read  rpcrarlon 
follows  in  the  wake  of  each  write  operation  begin¬ 
ning  with  address  1  and  ending  with  address  M. 
llclcct  Ion  of  a  failure  diverts  the  tester  into  a 
note  complex  Jnlt JalizntJ on  procedure,  without 
prejudice  to  the  already  acquired  failure 


information.  One  way  or  another,  by  the  time 
address  M  is  reached,  the  state  of  the  DUT  Is 
duplicated  In  the  Main  Memory  as  well  as  In  the 
Reference  Memory;  this  state  of  the  DUT  is  denoted 
by  Xj  for  linguistic  convenience. 

In  the  case  of  the  first  DUT,  random  testing  be¬ 
gins  immediately  after  initialization.  Test  vec¬ 
tors'  furnished  by  the  Random  Number  Ccnerator  are 
fed  simultaneously  to  the  DUT,  Che  Reference 
Memory  and  sector  1  of  the  Main  Memory.  The  Pat¬ 
tern  Block  of  sector  1  follows  the  DUT,  while  the 
Vector  Sector  stores  the  individual  test  vectors 
in  FIFO  fashion.  When  a  failure  is  detected,  the 
information  in  the  Vector  Sector  is  used  to  re¬ 
store  the  system  to  either  state  X.  or  X  ,  which- 

X  -n 

ever  may  apply.  The  trace  of  the  failure  is  then 
rerun  through  the  DUT  and  the  Reference  Memory  for 
verification  purposes.  We  need  to  know  whether 
the  observed  failure  can  be  reproduced  at  will. 

The  Pattern  Sector  does  not  participate  in  the 
second  phase  of  the  verification  exercise;  it  just 
retains  (.he  "initial  pattern"  Information  —  ei¬ 
ther  X,.  or  X  —  for  use  on  other  DUTS. 

I  -n 

When  DUT  02  is  tested,  it  is  initialized,  tested 
for  susceptibility  to  the  trace  which  failed 
DUT  Jl,  and  only  then  does  random  tasting  begin. 

It.  continues  till  DUT  S2  falls  and  the  results  of 
the  failure  are  duly  recorded  in  sector  number' 
two  of  the  Main  Memory  and  location  numuer  two  of 
the  Results  Memory.  DUT  03  is  initialized, 
checked  out  for  susceptibility  to  traces  S  1&2, 
and  then  exercised  with  random  vectors.  DUT  ffW 
is  tested  for  susceptibility  to  W-l  previously 
generated  traens  before  it  gets  a  chance  to 
generate  its  own  trace.  Results  of  all  tests  are 
recorded  in  the  Results  Memory. 

Cross-References 

Cross-references  help  with  the  diagnostics  of  a 
given  family  of  devices.  As  far  as  traces  gener¬ 
ated  by  other  devices  are  concerned,  a  D'JT  may 
pans  a  test,  fail  during  loading,  fail  cn  comple¬ 
tion  of  all  n  steps  6f  a  trace,  or  it  may  fail 
early.  It  may  also  be  either  able  or  unable  to 
reproduce  the  failure  spelled  out  by  its  own 
trace.  We  want  to  know  how  many  devices  displayed 
unconfirmed  failures,  how  many  ialled  early,  and 
so  on. 

Purely  random  failures  are  not  reproducible.  Nu¬ 
clear  radiation  problems,  if  any  exist,  should 
generate  a  fair  number  of  "unconfirmed"  errors. 
Needless  to  say,  however,  there  exist  many  causes 
of  random  errors.  Convincing  evidence  of  random¬ 
ness  is  a  necessary,  though  highly  insufficient, 
prerequisite  of  arguments  uhich  seek  to  attribute 
device  failures  to  background  radiation. 

Fabrication-process  flaws  come  in  many  guises. 
Pinhole-size  flaws  vary  in  position  from  chip  to 
chip  and  from  wafer  to  wafer.  Patch-size  flaws, 
on  the  other  hand,  Lend  to  show  up  in  the  same 
section  of  the  wafer;  a  significant  fraction  of 
all  ch'ps  may  display  the  same  type  of  flaw  and 
the  same  typo  of  failure.  In  any  case,  be  they 
pinhole  or  patch  size,  processing  flaws  arc 
generally  reproducible;  they  do  not  generate 
'Vnconf luted"  failure'). 
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Circuit  design  flaws  are  highly  reproducible.  They 
are  certainly  permanent,  and  they  are  common  to 
all  chips,  though  they  cay  show  up  only  in  re¬ 
sponse  to  one  particular  sequence.  Yield  Is 
likely  to  be  low.  If  one  particular  trace  dis¬ 
qualifies  cany  chips,  the  possibility  of  a  circuit 
design  flaw  should  be  investigated,  before  time  is 
Spent  on  other  explanations. 

All  in  o’?  then,  random  phenomena  bring  about 
unconfirmed  faults,  and  processing  flaws  gener¬ 
ate  fixed  failures  in  a  finite  fraction  of  a*ll 
chips,  while  circuit  design  delinquencies 
generally  result  in  low  yields. 

Conclusion 

A  self-learning  tester  is  feasible:  A  CACHE 
arrangement  with  a  disc  and  a  fail-sized  semi¬ 
conductor  menory  will  service  both  the.  Main  Memory 
end  the  Result  Memory.  All  other  hardware  require¬ 
ments  are  quite  modest. 

A.part  from  generating  useful  sequences  of  test 
vectors,  a  self-learning  tester  will  accumulate 
statistics  for  diagnostic  endeavors.  It  will 
separate  random  perturbations  from  fixed  flaws, 
and  will  supply  information  for  yield  analysis. 

With  luck,  it  may  ever,  uncover  complications 
overlooked  by  the  designer. 

It  has,  of  course,  its  limitations.  Evaluation  of 
151  devices  calls  for  thorough  analysis,  explor¬ 
atory  experimentation,  hard-failure  testing,  and 
random-testing.  Self-learning  testers  are  only 
for  random  testing  and  certain  aspects  of  explor¬ 
atory  testing. 
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Classification  of  Errors 

The  JC-42-78-65  publication  of  the  Join  Electron  Device 
Engineering  Council  (JEDEC)  classifies  errors  into  three  cate¬ 
gories,  namely  hard,  medium  and  soft.  Hard  Errors  are  manifested 
by  cells  which  "will  not  properly  store  data  under  any  condition 
of  test  operation".  Medium  Errors  are  said  to  exist  when  "cells 
will  properly  store  data  under  some  conditions  of  operation 
(voltage,  temperature,  timing  and  pattern),  but  will  make  errors 
under  other  conditions  of  operation".  Soft  Errors  correspond  to 
"an  occasional  random,  loss  of  data.  The  reason  for  the  error  is 
normally  not  identifiable,  nor  is  it  repeatable  on  a  device  tester". 

Our  own  preference  favors  a  straight  binary  division  into 
hard  and  soft  errors  only.  "Hard"  errors  are  attributable  to 
"hardware"  defects  such  as  shorts,  missing  contacts  and  floating 
gates.  "STUCK-AT"  faults  are  classical  examples  of  hard  errors; 
if  a  node  is  "STUCK-AT-ZERO"  it  will  be  low  regardless  of  input 
data.  In  that  sense  one  can  say  that  the  manifestation  of  hard 
errors  is  data  independent.  Soft  errors  on  the  other  hand  appear 
only  in  the  wake  of  certain  patterns  and  sequences  of  data  or 
addresses.  They  are  attributable  to  design  and  processing  flaws 
as  much  as  to  environmental  perturbations. 

In  brief,  we  distinguish  merely  between  data-independent  and 
data-dependent  errors.  Limited  in  scope  as  this  classification 
may  be,  we  need  it  to  articulate  a  comprehensive  strategy  of 
testing.  The.  word  data,  as  used  here,  refers  both  to  addresses 
and  to  input  information. 


Classification  of  Soft  Errors 

No  classification  is  more  important  than  the  division  into 
random  and  non- random  errors.  The  latter  can  be  reproduced  on 
command  but  the  former  cannot  1  This  is  indeed  how  we  can  and 
should  tell  them  apart.  VJhcn  an  error  is  observed  one  ought  to 
jump  back  sufficiently  far  in  the  test  program  to  guarantee 
re-emergence  of  reproducible  errors.  Observations  of  failures 
must  be  verified  and  diagnosed  if  the  testing  of  VLSI  devices  is 
to  become  effective.  Test  programs  should  include  appropriate 
retrace  loops. 

Nuclear  radiation  perturbations  are  among  the  most  publicized 
causes  of  random  errors.  Faced  with  them,  one  ought  to  determine 
their  rate  of  occurrence.  Even  more  important  is  the  question  of 
single  or  burst  errors.  Single  errors  can  be  corrected  quite 
easily,  but  bursts  cannot.  Cosmic  rays  are  potentially  more 
dangerous  than  alpha  particle.  Polyimide  coating  of  the  chip  is 
used  nowadays  to  guard  against  radioactive  materials  imbedded  in 
the  package.  The  range  of  alpha  particles  is  limited  to  100  microns. 
Cosmic  rays,  on  the  other  hand,  cannot  be  stopped,  not  unless  one 
is  willing  to  use  ten  centimeters  of  lead. 

Reproducible  soft  errors  are  mostly  a  matter  of  design  or 
fabrication  flaws.  The  circuitry  of  modern  memories  is  complex; 
there  may  be  performance  marginalities  in  the  address  decoders, 
or  the  output  section  or,  as  is  frequently  the  case,  the  bit-line 
circuitry.  Figure  1  illustrates  the  interdependence  of  the  many 
elements  associated  with  the  sense  amplifiers  of  dynamic  RAMs. 
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Commercial  Test  Patterns 

Various  test  programs  were  developed  by  vendors  and  users  of 
semiconductor  memories.  Manufacturers  of  test  equipment  contri¬ 
buted  their  share  too.  More  or  less  accurate  claims  regarding 
the  effectiveness  of  individual  programs  are  based  on  empyrical 
data  rather  than  mathematical  arguments.  They  should  be  treat,  d 
with  caution.  All  reasonably  good  programs  take  care  of  hard 
errors;  the  justification  of  extra  cost  comes  from  soft  errors. 
These  differ,  however,  from  one  product  to  the  next  and  one  ven¬ 
dor  to  another.  Consequently,  there  are  no  universally  good 
programs.  To  be  cost  effective,  a  program  must  be  personalized 
with  respect  to  a  particular  product  and  vendor  with  attention  to 
modifications  continually  introduced  by  memory  manufacturers  in 
the  interest  of  yield. 

It  takes,  of  course,  time  to  analyze  a  new  product  and  to 

develop  an  appropriate  program.  One  cannot  stop  the  world  while 

waiting  for  .the  ideal  test  to  emerge.  That  is  where  commercial 

programs  play  a  useful  role  in  spite  of  their  limitations.  We 

give  here  three  examples,  not  necessarily  the  best.  The  total 

1  3/2  2 

number  of  operations  may  be  proportional  to  N  or  N  '  or  N  . 

Our  purpose  is  to  demonstrate  the  differences  between  these  three 
categories  of  tests.  Our  programs  are  written  in  self-explanatory 
pseudo-fortran  language. 

We  begin  with  the  well  known  MARCH  program. 


March 


Reset  the  background  to  ZERO 

For  i=0  to  N-l 

Read  ZERO 

Write  ONE 

Return 

For  j  =  N-l  to  0 
Read  ONE 
Write  ZERO 
Return 

Reset  the  background  to  ONE 

For  i=0  to  N-l 

Read  ONE 

Write  ZERO 

Return 

For  j  =  N-l  to  0 
Read  ZERO 
Write  ONE 
Return 


Total  number  of  cycles 


Galpat 

A.  Reset  background  to  ZERO 

Write  ONE  into  cell  0 

For  i=l  to  N-l 

Read  cell  0  • 

Read  cell  i 
Return 

Write  ZERO  into  cell  0 

For  i=l  to  N-2 

Write  ONE  into  cell  i 

For  j=0  to  i-1  and  j=i+l  to  N-l 

Read  cell  i  . 

Read  cell  j 
Return 

Write  ZERO  into  cell  i 
Return 

Write  ONE  into  cell  N-:l 

For  i-0  to  N-2 

Read  cell  N-l 

Read  cell  i 

Return 

Write  ZERO  into  cell  N-l 

Half  of  the  total  number  of  cycles 

B.  Reset  background  to  ONE 


#  of  operations 


2N (N-2 ) 


2N2+N 


Run  program  similar  to  part  A  with  appropriate  change  from 
ZERO  to  ONE  and  vice  versa 


Total  number  of  cycles 


2{2N  +N) 
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As  to  the  details  of  the  GALTDIA  program,  refer  to  figure  1  and 
proceed  as  follows: 

A.  Reset  the  field  to  ZERO  , 

20  For  i=l  to  2/ N-2 
Go  to  diagonal  i 
For  j~0  to  i 
Select  a  test  cell 
Read  the  previous  cell 
Read  the  test  cell 
Complement  the  test  cell 
Gallop  along  the  diagonal 
Complement  the  test  cell 
Return 
120  Return 

B.  Reset  the  field  to  ONE 
Repeat  lines  20  through  120 

To  proceed  with  the  computational  aspects  of  GALTDIA,  let  us 
go  through  the  following  observations: 


(No.  of  cells  in  diagonal  i)  =  i+1  (3) 
No.  of  gallop  operations  per  test  cell  =  2i  (4a) 
No.  of  conditioning  opertions  per  test  cell  =  4  (4b) 
Total  No.  of  operations  per  test  cell  -  2(i+2)  (4c) 
No.  of  operations  per  diagonal  =  2(i+l(i+2)  '  (5a) 

-  2i 2+6i+4  (5b) 
No.  of  operations  in  a  Reset  step  =  N  (6) 


Now  let  Q  be  the  total  number  of  cycles  in  a  GALTDIA  program. 
-  Noting  the  numerical  duplication  between  diagonals  1  through  /N~2 
on  one  hand  and  /N  through  /N-2  on  the  other,  one  can  write  Q  as 


/N-2  9 

Q  =  2{  l  2f2(i)  +6i]  +  0  (/N-2 )  +  2/N(/N-U)+n} 
i=l 


(7a) 
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Substitution  of  equations  1  and  2  into  7a  leads  immediately  to 


Q  =  2{~-(/N-2)  (/N— |)  (/N-l)+6(/N-2)  (/N-l ) +3N+10/N-16J  (7b) 


=  2(-j  N3/2  +  3N  +  ~  N1//2  -  C) 


(7c) 
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Testing  of  VLSI  Hardware 

Small  circuits  can  be  tested  exhaustively;  large  circuits 
cannot.  Soft  errors  are  responsible  for  the  dilemma.  Being  data 
dependent,  they  may  show-up  in  the  wake  of  but  a  few  combinations 
of  input  and  address  bits.  Manifestations  described  vividly  as 
pattern  and  sequency  sensitivity  are  indeed  quite  common.  Many 
problems  arise  also  from  inadequate  identification  of  worst-case 
conditions.  As  it  happens,  memories  pose  more  problems  than 
microprocessors . 

To  demonstrate  the  futility  of  massive  testing  consider 
simple  pattern  sensitivity  in  a  small  memory  and  let  your 
intuition  translate  the  results  into  large-memory  language. 

N  bit''  can  be  arranged  into 


C(N)  =  2 


N 


(1) 


different  combinations  such  as 


0 

0 

0 


0  0 
0  0 
1  0 


0  0 
0  0 
1  0 


0 

1 

1 


(2) 


to  examine  each,  one  would  have  to  perform  at  least 


T(N)  =  2(N+1* 


(3) 


tests . 


This  would  add  up  to 


T(  1024  }  «  21025 
-  10300 

tests,  in  the  case  of  a  lk  memory.  Working  at  the  rate  of  ten 
million  test  per  second,  one  would  need 

S(1024)  =  10293  seconds 

=  3  x  102®^  years 


to  complete  the  test. 

We  need  not  go  any  further.  An  intelligent  alternative  to 
exhaustive  testing  is  obviously  called  for. 


! 
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Test  Strategy 

Chip  design,  system  architecture,  testing  and  reliability  are 
inter-dependent.  The  testability  objective  imposes  -  v:hen  it 
comes  to  memories  -  two  conditions  on  system  architecture. 

1.  In  a  m-by-N  memory,  use  1-by-N  chips  in  preference  to 
m— by— ( N/m )  chips  {see  figure  2);  the  former  are  less 
likely  to  produce  multiple  errors. 

2.  Use  one  bit  error  correction.  This  is  not  half  as  expen¬ 
sive  as  it  appears  to  be.  The  correction  code  can  be 
applied  to  multiple  words  in  order  to  reduce  the  overhead 
burden  (see  reference  13). 

The  actual  test  is  organized  into  three  phases,  the  last  two 
of  which  can  be  combined  into  one. 

1.  parametric  testing 

2.  hard-error  testing  by  means  of  general  purpose  programs 

3.  soft-error  testing  by  means  of  personalized  programs 

Parametric  testing,  that  is  the  measurement  of  supply  currents, 

* 

input  capacitances  etc.  identifies  bad  units  very  efficiently. 

It  can  also  be  used  to  monitor  the  consistency  of  the  fabrication 
process  and  thus  issue  warnings  of  soft-error  hazards. 

The  hard-error  test  is  supposed  to  ensure  that  1)  each  cell 
will  properly  accept,  store  and  return  a  HIGH  or  a  LOW  and  2)  the 
decoding  circuitry  actually  yields  N  distinct  addresses.  The 
pi  :  •  i  1  i  v c 


p  ngram  of  figure  3  will  accomplish  both  these  tasks. 

•  programs  will  do  that  also  but  by  more  ambitious 
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Programs  for  the  detection  of  soft  errors  should  come  from 
intensive  theoretical  and  experimental  study.  Every  product  and 
every  modification  of  every  vendor  must  be  investigated.  Mature 
products  display  fewer  -  and  possibly  different  -  errors  than 
their  predecessors.  Once  the  causes  of  marginal  performance  are 
known,  suitable  programs  can  be  written  and  combined  with  the'- 
hard-error  complements.  Commercial  (and  therefore  general 
purpose)  programs  attempt  to  do  just  that,  but  their  cost  effec¬ 
tiveness  is  poor  indeed.  Only  tests  make  sense  when  it  comes 

o  3/2 

to  large  chips  (1GK  and  up);  N'  and  even  N  "  programs  are 
completely  inacceptable .  There  is  no  choice:  One  must  analyze 
the  design  and  probe  the  chip  till  all  significant  sources  of 
soft  errors  are  identified  and  eliminated.  To  be  feasible,  test 
programs  for  soft-errors  must  be  personalized.  Furthermore  it  is 
necessary  to  base  test  strategies  on  the  premise  that  in  devices 
which  have  passed  the  hard-error  test,  soft-errors  are  only  few 


a)  Good  b)  Bad 

Fig. 2:  Architecture  of  a  16-by-lGk  Memory 
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The  Cost  of  Error  Correction 

At  least  two  items  must  be  identified  when  error  correction 
is  contemplated:  1)  The  access-time  penalty  and  2)  the  cost  of 

l 

excess  hardware.  Dynamic  response  cannot  be  quantized  without 
reference  to  both  system  details  and  chip  characteristics;  a  lot 
depends  on  the  difference  between  access-time  and  cycle-time  - 
pipelining  can  be  used  to  shift  the  focus  from  the  former  to  the 
latter.  The  excess  hardware,  on  the  other  hand,  can  be  projected 
in  general  terms. 

To  assess  the  cost  or  error-correcting-hardware ,  let  there  be 
m  data  bits  and  k  correction  bits  for  a  total  of  m+k.  There  may 
be  an  error  in  any  one  of  the  m+k  bits.  Required  is,  therefore, 
a  message  which  states: 

there  is  no  error 
or  there  is  an  error  in  bit  one 
or  there  is  an  error  in  bit  two 


or  there  is  an  error  in  bit  m+k 
v 

That  adds-up  to 

M  =  m+k+i  (1) 

messages  which  are  to  be  articulated  by  means  of  k  bits. 

There  is  a  repertiore  of 

K  =  2k  (2) 

messages  in  k  bits.  What  one  wants  is 


mSmmmm 
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K  >  M 


(3) 


and,  therefore 
2k  >  m+k+1 


(4) 


i.e . 

2k-k  >  m+1 


(t>) 


Table  I  confirms  what  was  to  be  expected:  the  cost  of  excess 
hardware  is  acceptable  if  one  operates  on  sufficiently  long  words. 
Consequently,  one  might  consider  error  correction  for  multiple 
rather  than  single  words  and  that  is  exactly  what  is  done  in  IBM 
system  370/158  (reference  13  of  Test  Strategies).  Application  to 
memories  of  block-oriented  error  correction  is  a  viable  research 
topic . 

Nota  bene  that  we  talk  about  off-chip  error  correction. 

On-chip  correction  should  not  be  attempted  without  prior  atten¬ 
tion  to  many 'outstand ing  physical  and  architectural  details. 

Table  i  :  Single  Bit  Error  Correction 


1 

2 

4 

8 

16 

32 

64 


k 

2 

3 

3 

4 

5 

6 
7 


%  efficiency 

33 

40 

57 

67 

76 

84 

90 


128 


8 


94 


Conclusion 

Less  is  better,  when  it  comes  to  testing.  Invest  in  research 
rather  than  test  equipment.  Massive  testing  is  expensive  yet 
unproductive.  Emphasis  must  oe  put  on  the  elimination  of 

f 

multiple  errors  rather  than  the  detection  of  every  individual 
error.  That  is  why  perusal  of  circuit  design  is  crucially  impor¬ 
tant.  Design  flaws  lead  to  the  inacccptable  multiple  errors. 
Testability  -  in  the  case  of  memories  -  means  1)  architecture 
per  figure  2  and  2)  single-error  correction. 

Hard  failures  can  be  caught  easily.  Soft  errors  should  be 
rare;  single-error  correction  will  eliminate  them.  A  continual 
diagnostic  effort  must  be  maintained  where  reliability  is  of  a 
premium. 

Parametric  testing  can  provide  clues  and  warnings.  The  con¬ 
sistency  of  the  manufacturing  process  must  be  monitored  by  the 
user  as  well  as  the  manufacturer.  In  VLSI  testing,  there  is  no 
substitute  for  know-how.  , - ,  _ _ 
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IDENTIFICATION  OF  CAUSES  OF  PATTERN  SENSITIVITY 


D.  D.  Rincrson  and  A.  Tuszynski 


Abstract 

Pattern  Sensitivity  at  the  device  level 
is  attributable  to  deficiencies  in  circuit 
design  and  processing;  at  the  system  level, 
user  misdeeds  must  be  additionally  taken  ihto 
account.  Meaningful  testing  for  Pattern 
Sensitivity  requires  diagnostic  expertise  in 
all  these  areas.  Though  difficult  to  detect 
by  hit  and  miss  techniques.  Pattern  Sensitiv¬ 
ity  is  deterministic  ir.  nature,  can  be 
tracked  down  by  sufficiently  sophisticated 
testing,  and  will  be  eventually  eliminated 
through  improvements  in  circuit  design  and 
process  control. 

Introduction 

Though  the  concept  of  Pattern  Sensitiv¬ 
ity  (PAS)  is  somewhat  elusive,  Schmoo  plots 
tell  the  story  eloquently.  Figures  1  and  2 
Show  the  results  of  tests  performed  on  a 
suspect  4k-RAM.  The  Schmoo  plot  for  "Column 
Disturb’  differs  significantly  from  that  for 
"Galpat".  Though  each  of  the  two  programs 
checks  the  operation  of  every  individual 
Storage  cell,  the  unit  under  test  passes  the 
Galpat  test  but  fails  the  Column  Disturb.  The 
difference  in  the  Schmoos  is  attributable 
solely  to  the  difference  in  the  data  patterns 
generated  by  the  respective  programs.  There 
is  nothing  accidental  in  these  results.  Once 
identified,  Pattern  Sensitivity  can  be  repro¬ 
duced,  on  command,  as  demonstrated  by  the  rerun 
tests  shown  in  figures  3  and  4.  Other  tests 
show  that  the  conslusions  regarding  the  rela¬ 
tive  efficiency  of  particular  programs  can  be 
extended  to  all  devices  of  the  same  type  and 
the  came  origin,  but  not  necessarily  to 
second-source  parts. 
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FIGURE  2  :  COLUMN  D5TURP  TAKEN  1/24/77  at  7:30  PM 

In  cpit.o  of  the  complexities  of  LSI, 
there  is  actually  no  mystery  in  PAS.  Evident¬ 
ly  enough,  the  Galpat  program  does  not  gener¬ 
ate  the  worst  case  pattern  of  the  given  type 
of  device.  The  Column  Disturb  test  is  more 
severe  and,  incidentally,  more  efficient  than 
Galpat  -  it  takoo  20  minutes  to  run  a  full 
TACC/\‘dd  Column  Disturb  Schmoo  but  a  full  hour 
■to  do  tho  Btmo  for  Galput  -  yet,  for  all  we 
know,  it  nay  not  ho  nevere  enough.  Only 
thor.c  programs  which  Actually  goncr/ste  worct 
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case  data  patterns  are  sufficiently  severe. 
Good  programs  generate  worst  case  patterns 
efficiently.  To  develop  such  programs,  one 
must  begin  with  an  adverse  analysis  of  the 
circuitry  and  then  follow  through  with 
process  diagnostics  and  exploratory  testing. 


The o rctical  Speculations 

Any  deterministic  system  can  bo  com¬ 
pletely  modeled  by  matrices  A  and  E  to  be 
used  as  operators  in  equations  which  relate 
the  state  variables  x  to  the  input  vector  u 

as  in 

x  **  A  x (t)  +  B  u (t)  (3 ) 


Unfortunately,  in  real-life  L.'it  prob¬ 
lems,  the  dimensions  of  matrices  A  and  li  are 
prohibitive  while  the  elements  a...,  b_,  are 
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non-linear  and  only  vaguely  predictable. 
Consequently,  one  seldom  employs  the  full 
state-space  presentation  in  actual  analysis 
of  LSI  circuits.  Nonetheless,  a  few  useful 
conclusions  can  be  dravm  from  a  free-wheel¬ 
ing  interpretation  of  these  equations. 

Microprocessors,  memories  and  most  other 
LSI  devices  fall  into  the  category  of  "con¬ 
tinuous-time  systems  with  discrete-time 
inputs".  .The  transition  equations  of  such 
systems •  relate  the  state  variable  x  t. 
the  input  u  through  the  matrix  operators  A 
and  B  as  in 


x(kT4T)  =  [eAT)x<kT)-( 


rkT+r 


’  kT 


(e 


A(kT+t-v) 


] bu (v)dv 
(2) 


where;  T  «*  clock  period 
and  (kT  +  x  •»  t.) 


Kith  this  ir.  mind.,  let  us  feed  new  data 
into  the  system  at  t  =  kT  and  let  u:,  observe 
the  state  of  the  system  at  t  =>  kT  +  aT, 
where  a  is  less  than  one,  but  greater  than 
kero.  Beginning  at  t  =  o  with  an  initial 
state  x(o),  vc  get 

/aT 

x  (aT)  “  leaATJ  x  (o)  4- J  leA(t~v)]Bu(v)dv  (3) 

at  the  time  of  the  first  observation  and 


x {x+aT)  =  [eaA* ]  x  (T)  + 


,T4-aT 


.  A(TtaT-v) ,  ,  . 

[e  ] Bu (v)dv 


«) 


at.  the  time  of  the  second. 

Evidently  then,  the  state  of  the  system  at 
t=aT  depends  on  the  initial  state  x(o)  as 
well  as  the  input  u(o),  while  the  state  at 
t-T+aT  depends  on  u(l),  u(o)  and  t(o). 

In  other  words,  the  output  at  t  -  T+aT 
depends  not  only  on  the  data  pattern  fed 
into  the  system  at  t+T  but  also  or.  the  state 
of  the  system  at  t-T,  and  that  depends  in 
turn  on  the  data  input  at  t-o  as  well  as  the 
initial  state  x(o).  If  we  now  read  the  last 
output  as  either  true  or  false  only,  then  wc 
may  conceivably  get  "correct"  or  "incorrect" 
outputs  in  response  to  a  fixed  u(l)  but 
vat itd  u(o)  or  x(o).  This  constitutes 
cither  pattern  or  sequence  sensitivity  or 
both,  depending  on  the  interpretation  of 
pertinent  events. 


1C7 


The  extent  of  the  above  dependence  is 
directly  related  to  the  duration  of  the  clock 
period  T.  The  effect  of  history  decays  with 
time  end  consequently,  as  the  period  is 
gradually  extended,  x{T+aT)  becomes  virtually 
independent  of  \«  Co)  and  even  more  independent 
Of  x(o).  Furthermore,  its  value  approaches 
the  extrapolated  steady  state  level 

I  * 

limit  x(T-taT)  «  limit  [  [e7^*  v^]Bu(v)dv 

T-»<»  t-*~“  •*  T 

It  follows  tlien  that  one  should  never  encoun¬ 
ter  dynamic  Pattern  and  Address  Sequence 
Sensitivity  (PAASS)  at  relatively  low  speeds 
although  one  will  invariably  run  into  compli¬ 
cations  at  excessive  speeds.  To  avoid  dynamic 
PAASS  one  must  simply  determine  the  worst  case 
dynamic  response  in  terms  of  data  patterns , 
cell  locations,  clock  skews  or  what  have  you, 
and  then  spec  out  the  device  accordingly. 

Khat  to  the  eyes  of  the  accidental  observer 
appears  to  be  random  TAAoS,  is  actually  a 
deterministic  demonstration  of  excessive 
Epocd.  Vou  can't  run  on  8 MHz  unit  at  lOMliz 
and  blame  your  '■•tcblems  on  I'AASS. 

The  transmission  matrices  (A.  and  B)  of 
X£l  devices  are  less  sparse  than  those  of 
equivalent  discrete  circuit;;.  The  difference 
lien  of  course  in  skew  elements  that  model 
crosstalk.  Naturally  enough,  if  a  device 
works  at  all,  one  expects  that  significant, 
parasitic  coupling  is  limited  to  neighboring 
devices.  Thus,  in  an'  assembly  of  storage 
cells,  one  expects  more  crosstalk  between, 
cay, 

ftii  and  ai-l,  i+1 

V 

than  between 

&ii  nnd  ai-2,  i-t 2 

Particularly  important  are  of  coarse  cumula¬ 
tive  effects.  Consider  for  example  the  bit- 
line-sensing  arrangement  introduced  by  TI  a 
few  years  ago  and  adopted  since  then  by  many 
Other  manufacturers  of  dynamic  memories 
(figure  5).  The  voltage  difference  AV,  read 
by  the  sense-amplifier,  depends  on  1)  the 
Charge  in  the  addressed  cell,  2)  the  charge 
in  the  reference  cell  and  3)  the  nodal  volt¬ 
ages  of  the  stand-by  cells.  Tin;  last  of 
these  contributions  is  of  course  a  parasitic 
force  vhich  arises  from  conductive  and  non¬ 
linear  reactive  leakage.  If  the  cells  on  Che 


left  hold  many  more  zeros  than  those  on  the 
right  (or  vice  versa) ,  then  the  parasitica 
may  become  significant.  Particularly  objec¬ 
tionable  are  data  patterns  which  have  a  sin¬ 
gle  zero  on  one  side  but  a  full  field  of  ze¬ 
ros  on  the  other,  or  conversely,  a  single 
high  on  one  side  but  a  full  field  of  highs  on 
the  other. 


FIGURE  E:  SIMPLIFIED  SENSE  AMP. 

Finally,  one  must  realize  that  most 
performance  deficiencies  arise  from  oversight 
on  the  part  of  the  designer. 

The  first  commercial  static  memories 
suffered  from  inadequate  address  separation , 
early  "one  transistor"  HAM:;  had  insufficient 
hit-line  swing  and  a  RAM  of  later  vintage  was 
fouled  np  by  an  illegal  combination  of  trans¬ 
mission  gates6.  Interestingly  enough, 
many  such  problems  were  detected  by  competi¬ 
tors  a  long  time  before  they  were  corrected 
by  the  original  vendors.  Design  flaws  can  be 
detected  by  bystanders  since  eyeball  analysis 
is  frequently  more  effective  than  CAD.  One 
should  bear  that  in  mind  when  testing  LSI 
devices . 

Explore tory  Testing 

Although  theoretical  speculations  yield 
clues  rather  than  answers,  dues  can  be 
developed  into  answers  by  exploratory  test¬ 
ing,  In  the  case  of  the  16  pin  4k— RAM,  one • 
suspects.,  for  example,  susceptibility  to  VnD 
noise  and . RAS/CAS  delay  (tRWJ  jitter.  Ei¬ 
ther  of  these  ootild  lead  to  marginal  perfor¬ 
mance  and,  therefore,  pattern  sensitivity. 

It  is  evidently  advisable  to  perform  a  icw 
experiments. 

Susceptibility  to  V...  noise  was  invecti- 
gated  i.y  wears  of  the  equipment  shown  In 
figure  C.  Various  AC  signals  were  fed  into 
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I  the  r.cnse  port  of  the  regulator  while  “Col- 
*  uran  Disturb"  Sclimoo  runs  were  bein','  executed. 
Continuous  and  pulsed  sine  wave  excitation 
was  used  to  simulate  high  frequency  pertur¬ 
bation  of  V  D-  Somewhat  surprisingly,  the 
4k-RAMr.  withstood  this  challenge  very  well. 
Specifically,  all  devices  subjected  to 
these  tests  turned  out  to  be  immune  to 
50MHz  signals  of  2.4  volts  peak-to-poak 
amplitude.  Nonetheless,  an  entirely  differ¬ 
ent  story  emerged  from  low  frequency  tests, 
conducted  with  triangular  and  trapezoidal 
waves  of  various'  slew  rates.  Dependence  on 
synchronization  of  the  "noise"  to  the  system 
clock  was  expected  and  indeed  confirmed. 
Various  weaknesses  were  observed  on  devices 
procured  from  four  different  sources.  Most 
interesting  of  these  was  sensitivity  to 
write-read  supply-line  differentials.  A 
test  witfi  10. 3  volts  for  write  but  13.2  volts 
for  read  rejected  all  devices  of  one  manu¬ 
facturer  but  passed  every  one  of  the  devices 
supplied  by  the  othe.r  vendors.  The  units 
which  failed  the  test  -worked  quite- well  on 
smooth  margins  of  and  equally  well  when 

written  at  13.2  but  read  at  10.8  volts. 
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FIGUTiE  6  :  EQUIPMENT  SETUP 

The  above  mentioned  test  -  sometimes 
referred  to  as  the  "Voltage  Bump"  test  -  does 
j  detect  supply  line  sensitivity  very  effec¬ 
tively.  It  also  facilitates  identification 
of  weak  circuit  elements.  While  it  is  almost 
customary  to  point  to  the  sense  amplifier  as 
the  principal  victim  of  V[n)  perturbations,  we 

I  have  found,  for  example,  th.u.  the  internal 
timing  chain  is  equally  weak.  Many  devices 
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which  passed  the  Voltage  Bump  test  at  nominal 
tRL(.  failed  on  the  margins  of  that  parameter; 
they  worked  well  with  smooth  supply  lines  of 
10.8  and  13.2  volts  respectively,  but  not  with 
the  equivalent  Voltage  Bump.  It  is  evident, 
therefore,  that  the  Liming  and  noise  specifi¬ 
cations  of  memories  require  far  more  attention 
than  they  have  received  in  the  past. 

Fol low-up 

With  the  results  of  extensive  theoretical 
and  experimental  studies  in  hand,  it  is  rela¬ 
tively  easy. to  write  programs  which  surpass 
in  effic^.  icy  any  assortment  of  Marches, 
Galpats  and  Disturbs.  One  is  convinced  that 
PAS  suspects  can  be  and  will  be  caught  practi¬ 
cally  v;ithout  fail.  Nonetheless,  or.e  knows 
tlwit  field  failures  occur  far  too  often. 

Worse  still,  the  "rejects"  frequently  check 
out  as  good  devices  when  retcafed  in  the 
laboratory.  Evidently,  there  must  be  a  lost 
link  somewhere. 

Random  PAS  is  certainly  a  possibility. 
Accidental  coupling  between  physically  distant 
and  electrically  unrelated  storage  cells  can¬ 
not  be  tracked  down  by  deterministic  means. 

The  detection  statistics  of  such  faults  are 
absolutely  incrediile.  Fortunately,  th:  like¬ 
lihood  of  such  faults  is  very  small  in  the 
first  place,  certainly  much  smaller  than  the 
rate  of  field  failures.  We  must  look  some¬ 
where  else  for  an  explanation .  There  nay 
exist  some  overall  system  problems.  Worst 
case  combinations  td  noise,  signal  delays  and 
supply  perturbations  may  just  suffice  to  upset 
a  fev?  marginal  devices.  Poor  understanding  of 
latent  characteristics  of  LSI  devices  is  the 
main  cause  of  such  problems.  Current  spikes 
during  "refresh"  exceed  the  average  consump¬ 
tion  by  an  order  of  magnitude  ar.d  may  induce 
significant  noise  on  -supply  lines,  in  ground 
wires,  and  at  the  signal  ports.  The  refresh 
mechanism  of  raemorim  may  in  itself  be  suscep¬ 
tible  to  various  kinds  of  noise.  In  the  pres¬ 
ence  of  noise,  some  memories  work  in  vrit.c- 
delay-rcad  cycles,  hut  fail  in  vr ite-tci  resh- 
read  operations.  Especially  treacherous  , 
though,  is  the  lat.vit.  sensitivity  of  the  CAN/ 
HAN  delay.  In  so.:."  I  Gk-ltAMs ,  the  ran""  it 

lur.c  is  roth,'u'''  r -■•••’  fl°  K  !  1‘  ''  '  ' 

Voltage  r.ump  ti.’ii 
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rot  surprising  that  field  failures  do  occur.  Computer  Design,  Dec.  75,  pp.  81-88. 


Some  of  these  may  he  attributable  to  uniden¬ 
tified  PAS,  but  many  more  arise  from  incom¬ 
plete  information  on  the  mundane  characteris¬ 
tics  of  semiconductor  memories. 

Conclusion 

Flaws  in  circuit  design,  chip  layout, 
and  processing  cause  PAS  as  well  as  all  other 
faults.  Circuit  design  problems  lead  the 
pack,  but  they  can  be  identified  by  theoreti¬ 
cal  and  experimental  analysis.  Extensive 
exploratory  work  is  essential,  but  experimen¬ 
tal  results  must  be  treated  with  caution. 

They  portray  complex  phenomena  and  frequently 
apply  only  to  devices  of  common  origin.  For 
example,  the  Voltage  Bump  test  observed  to 
fail' a  group  of  devices  with  1Q.8  to  13.2 
volt  write  to  read  ramps  works  in  reverse  on 
memories  from  other  families.  > 

Data  sheets  occasionally  conceal  more 
than  they  reveal.  Specifications  are  mean¬ 
ingless  in  the  absence  of  noise  limits. 

Timing  tolerances  depend  on  supply  line 
noise.  The  refresh  mechanism  of  some  devices 
suffers  from  defects  other  than  plain  leak¬ 
age.  All  such  problems  lead  to  pattern  sen¬ 
sitivity,  but  there  is  nothing  mysterious 
about  them.  Pattern  Sensitivity  is  essen¬ 
tially  deterministic  in  nature;  it  can  be 
identified  and  will  be  eventually  eliminated 
at  the  design  level  by  better  circuit  and 
process  models. 
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Abstract 

Pseudo  Analog  Testing  circumvents  difficulties 
encountered  when  attempting  to  diagnose  large 
monolithic  circuits.  Rather  than  swamp  a  high- 
ir.pedance  network  with  the  input  capacitance  of  a 
test  probe,  one  relies  on  indirect  but  credible 
results.  To  estimate  the  behavior  of  a  critical 
node  3,  one  observes  the  stream  of  data  which 
flows  from  the  output  port,  while  exploratory  ana¬ 
log  perturbations  are  applied  to  a  robust  "upwind" 
node  A.  Performance  margins  and  ether  bits  of 
information  are  deduced  from  the  magnitudes  of 
those  perturbations  which  convert  true  ONES  into 
false  ZEROS  and  true  ZEROS  into  false  ONES. 

Background 

Examining  tne  benavicr  of  new  circuits,  one 
docs  not  stop  at  the  I/O  pads.  Internal  nodes  are 
examined  ir.  pursuit  of  analog  information  that 
either  validates  expectations  or  divulges  the 
causes  of  ralfunctior.s.  Unfortunately  though, 
this  is  easier  said  than  done.  Power  economy 
being  a  fundamental  prerequisite  of  large  Scale 
Integration  (LSI),  one  is  invariably  faced  vith 
high'- impedance  regimes,  especially  ir.  MOS-LSI. 
There  might  be  some  stand-by  current  in  the  output 
p>orts,  tut  no  such  waste  will  be  found  in  the 
internal  branches  of  well-designed  chips.  Pre- 
charge  or  CMOS  techniques  are  the  rule  of  the  day. 
q  typical  subset  of  cc.tpcr.cr.ts  often  resembles  a 
stand-alone  capacitor,  over  a  substantial  part  of 
the  operating  cycle.  The  magnitude  of  the  equiva¬ 
lent  capacitor  cuy  be  quite  small  at  that,  much 
less  ^han  e  picofarad,  tore  often  than  not.  The 
itonltorir.g  problem  is,  therefore,  severe.  Conven¬ 
tional  oscilloscope  probes  are  entirely  inade¬ 
quate;  they  will  liter-fly  swamp  the  device  under 
test,  altering  the  local  voltage/current  relation¬ 
ship  or,  worse  still,  upsetting  the  normal  node  of 
operation  of  the  entire  circuit.  Relief  can  be 
gotten  -at  the  cost  of  compromises-  in  sODe  but 
r.ct  all  instances.  More  exotic  measures  must  be 
brought  into  play  to  deal  with  the  general  case. 

At  least  three  distinct  techniques  can  be 
eustered  when  the  electrical  characteristics  of 

'  *Work  Tcading  to  this  article  vns  funded  by  the 
Office  of  Naval  Research  under  contract  No. 
OOG1N-7S-C-OTH  mnitored  by  Dr.  Joel  M.  Morris. 

••Daniel  Burbank  lo  new  with  the  Honeywell  Solid 
State  Circuits  Center  ir.  Minneapolis,  MN. 


high-impedance  networks  are  to  be  measured  at 
regular  clock  rates.  First-off,  there  is  the  well 
knovr.  Low-Ir.put-Capacita.nce  Voltage-Follower  Il-j| 
of  Figure  1.  Positive  feedback  is  employed  in 
this  instrument  to  offset  the  real  and  tangible 
capacitances  of  the  probe  and  fixture  with  an 
artificial  negative  capacitance 

C  *  -C,^(al*K2  -  l)  (1) 

n  la 

There  is  a  flaw  in  this  scheme.  Being  feedback 
dependent,  Cn  varies  with  frequency.  If  fl  and  f2 
be  the  cut-off  frequencies  of  the  Buffer  end 
Amplifier  respectively,  then 


and  therefrom  cores  a  sobering  conclusion;  the 
probe  looses  its  effectiveness  at  fl/10  cr  f2/10, 
whichever  is  smaller. 


K1  Low-lnsji  -Capacitance  Fast  Buffet 
K2  Fast  Amplifier 


Ci-Cfi*Cr-(2j-t)Cta«0 

Fig.  1  A  Low  Input-Capacitance  Voltage-Follcwer 

A  second,  entirely  different,  technique  relies 
on  bombardment  with  a  narrow  tear,  of  electrons  to 
provide  resolution  Jr.  space,  time  and  voltage  of 
one  micron,  one  nanosecond  and  20  millivolts. 
Uniortur.ately  though,  the  impressive  frequency 
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response  c f  th«  »o  ti'.itS  Siruusccpic  Scanning 
Electron-  fUcrcisic;*  --'.I  it  uttriMtatie  entirely 
to  sampling,  T  "■  i 1  r«.ana  its*,  only  periodic  vave- 
forr.r.  «rr  repre:  :ra,  oi.«— sh.'t  phem.cntRa  and  noise 
are  sup  pressed.  Furthermore  ,  cur  nas  to  put-up. 
wit r.  the  rsur.y  :  ;r.ve:.ier.ces  oi  voc-ar.  equipment 
and,  worse  stil.,  the  rtq-i.-cd  equipment  it  nct( 
available  cone,,  r. lolly ,  Just  yet  1 7 , e J  -  That  is 
wry  yet  ar.cthe-  technique  -fstudo  Analog  Testir.g- 
does  acquire  ttc  status  of  a  viable  option,  in 
spite  of  its  own  peculiar  deficiencies. 

liit  I'eeudo  Analog  Technique 

Fseudy  Analog  Testing  15*1  circumvents  the  need 
to  probe  sensitive  nodes.  The  regular  flew  of 
data  is  obstructed  ty  quasi-static  analog  signals, 
Injected  into  »  relatively  robust  part  of  the 
circuit,  ty  means  of  a  laser,  a  current  source  or 
a  voltage  gene  rater  |iC|.  The  cbaructeristics  of 
a  sensitive  or  inaccessible  element  are  deduced 
froc.  the  j-etterr,  cf  errors  brought  on  by  the  ana¬ 
log  perturbations,  ir.  Figure  2,  block  C  stsnds 
for  the  sensitive  or  inaccessible  node,  the 
characteristics  cf  which  are  to  be  determined; 
block  E  represents  a  "robust"  r.cigr.tor.  Data 
flows  fro;.!  A,  through  S  and  C,  to  0.  The  analog 
perturbations  injected  at  B  affect  C  and  C  only. 
The  performance  rang. ns  of  C  are  measure!  in  terms 
of  magnitudes  end  polarities  of  perturbations 
which  cause  errors. 

This  sounds  fur  fetched  and  it  is,  often 
enough.  Or.e  can  name,  however,  applications  in 
which  Tacudo  Analog  Testing  does  take  sense.  A 
particularly  convincing  example  is  furnished  by 
the  bit  line  diagnosis  of  dynamic  EAXS. 


Pertinent  to  our  discussion  of  bit-line  cir¬ 
cuitry  arc  six  sets  of  components: 

1.  Storage  capacitor  Cj  or.  the  left  ar.d  reference 
capacitor  Crr  on  the  right  of  the  Sense  Amplifier 

2.  Gating  transistors  end  Crr 

3.  Reset  transistor 

A.  Sense  Amplifier  l20|'  transistors  and  Qsr 

5.  Sense-Enable  transistors  and  ^.r 

6.  Auxiljtiy  transistors  and 

Sot  to  be  overlooked  are  the  parasitic  capacitan¬ 
ces  of  the  tit  line,  simulated  by  lumped  capaci¬ 
tors  C-4  and  C-r  ir:  the  Mock  dirgram  of  Figure  3. 
These  distributed  capacitances  have  a  crucial 
bearing  on  the  performance  margin  of  the  RAX . 

They,  should  be  reduced  to  zero,  but  practical 
limits  are  set  by  the  state  of  the  art  in  pro¬ 
cessing  technology. 

A  regular  READ  cycle  includes  the  following 
events : 

1.  The  top  plate  of  the  reference  capacitor  is 
grounded  and  the  Sense-Enable  transistors  are 
cut-off,  while  the  auxiliary  transistors  art  dri¬ 
ven  into  a  high-impedcr.ce  state. 

2.  Both  sections  of  the  bit  line  are  precharged 
to  almost  Vpp 

3.  The  gating  translators  are  enabled  and  the 
voltage  on  the  right  hand  side  of  the  Sense 
Amplifier  is  reduced  to 


Fig.  2  Analog  Perturbation  of  Digital  Signals 

BiBynostlcr.  of  Dynamic  FAV.S 

Verifying  the  quality  cf  large  chips,  one 
takes  a  two-pronged  approach:  1GCJ  testing  for 
hard  failures  on  one  hand,  but  sampling  and  selec¬ 
tive  pursuit  of  soft  errors  on  the  other.  Soft 
errors  come  ir.  rr-ny  varieties.  Though  ml  func¬ 
tions  induced  ty  alpha  particles  Ill-la]  have  held 
the  limelight  in  the  sever.ticr,  ether  phenomena 
-processing  flaws  end  design  mistakes  uc.org,  therv. 
Ccn  be  acre  disruptive  llA-lfl.  Ml  kinds-  of 
obscure  sertnda ry  effects  frustrate  '.l.e  d-signer 
of  dynamic  KAXS  IllT-ii I.  Fortunately  the  ugh. 
many  a  perplexing  problem  cun  be  un:  a.  veiled  it  the 
bit-line  level  (Figures  3  arid  A). 


V  * 
r 


_ni_ 


c  ♦  c 

pr  rr 


(VED 


£’W 


(1) 


while  that  on  the  left  either  remains  relatively 
constant  at 


VH>  -  VDD  -  iVbDl 

or  elce  degrrdes  to 


(?) 


Vjd.) 


C~f*~ C  vVI)D  *  *VDiU  * 

III  ,1 


(3) 


depending  nn  the  data  stored  in  C^. 
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Bit  lin*  '*1 


The  4Vr,3  teres  in  the  above  equations  account 
for  the  degradation  of  the  bit  line  voltage  by 
precharge  imperfections.  What  really  matters, 
however,  is  the  difference  between  the  left  and 
the  right  rather  than  the'  absolute  value  or  the 
bit-line  voltage.  We  write,  accordingly, 


Fig.  U  Bit-bine  Circuitry  a  la  KK-tlib 
luations  account  C,  C, 


4V  «  AVn 


-  4V, 

uMl 


to  geS  a  simple  symbol  for  the  prechargc  error. 

It  veuid  appear  that  at  least  two  otner  complica¬ 
tions  aiSt  yet  be  acknowledged:  The  offset 
voltage  of  the  amplifier  and  the  stagger  of  the 
bit-line  capacitances.  The  forcer  is  denoted  c 
vhlie  the  latter  is  vritten  as 


ou  -  u,  -  v  ui 

•  p  pi  pr 

One  eight  add  that,  contrary  to  the  design 
lr.tent,  the  parasitic  bit-line  capacitances  Cpi 
and  Cpr  arc  tush  larger  than  the  storage  cell 
capacitors.  In  spite  of  that,  there  is  surpri¬ 
singly  little  merit  to  equation  5;  its  practical 
consequences  arc  negligible.  It  is  therefore  per¬ 
missible  to  write: 


_ CJl _ fi 

CP1  +  CJ  S 


C  t  c  c 
pr  rr  p 

The  effective  differential  voltage  £V) 
appears  at  the  input  terminals  "f  the  Sense 
Amplifier  car.  now  be  written  as: 

6V(H)  =  V1(H)  -  Vf  +  t 

■  (Crr/Cp)VW  +  c  -  AV 

or  ffV(L)  *  V1(b)  -  Vr 

-  -I.(Cj  -  Crr)/Cp)VDD  .  c  -  4V 

Corparison  of  equation  9  with  10  thovs  that 
results  will  be  obtained  with 


Crr  "  Cj  -  Crr 


c  ,  ■  c  »  c 

pi  pr  P 


C  /C.  •  1/2 
rr'  j 
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Verification  of  this  ratio  is,  accordingly,  the 
first  objective  of  bit-line  diagnosis.  Naturally 
enough,  one  also  wants  to  determine  c  and  AV. 

To  implement  the  pseudo-enaiog  technique,  one 
attaches  the  differential  bit-line  supply-network 
(Figure  5)  to  nodes  Xs,  Xp  and  Xr  of  the  PAM  'under 
test  (Figure  L).  The  old  metal  filaments  are 
destroyed  by  a  current  surge  free  a  50  microfarad 
capacitor,  A  r.e w  degree  of  freedom  is  thus 
secured.  The  right  section  of  the  bit  line  can  be 
offset  relative  to  its  left-hano  counterpart  by  an 
arbitrary  voltage. 


Fig.  5  Different,  al  utt-l  i.e  Supply 

The  RAM  under  test  is  operated  at  normal 
speed,  during  a  typical  FAT  exercise.  Data  is 
written,  refreshed  and  read  through  the  regular 
I/O  ar.d  control  pajis  or  the  chip;  nothing  is 
allowed  to  obstruct  the  flow  cf  digital  data. 
Analog  modulation,  on  the  ether  hand.  Is  injected 
through  three  ticroprcbes;  the  amplitude  and 
polarity  of  this  perturbation  is  adjusted  -in 
pseudo-static  style-  till  true  ONES  are  converted 
i  in  false  ZEROS  or  true  2EP0S  are  transferred  into 
*-falso  ON -5.  The  ovtrali  performance  margins 
become  immediately  apparent;  resolution  into  indi¬ 
vidual  error-cctpcnents  is  accomplished  through 
Judicious  variation  of  addresses  and  date  pat¬ 
terns,  fallowed  by  selective  blockage  of  gating 
transistors.  Sequential  hit-failure-mapping,  a  la 
Figure  C,  yields  beth  the  absolute  value  and  the 
spread  of  the  Cj/Crr  retio. 

Some  pseudo-ar.alog  testa  can  be  performed 
without  recourse  to  drastic  measures.  The  leakage 
characteristics  of  storage  cells  -to  name  one  pro¬ 
minent  parameter-  can  be  studied  through  the 
mechanism  of  the  extra-lcr.g  refresh  cycle.  One 
simply  finds  out  how  long  it  actually  takes  -at 
any  particular  temperature-  to  discharge  a  storage 
c-pacltor.  Again,  one  cetr  surprisingly 
interesting  absolute  and  relative  values. 

The  significance  of  relative  value*  is  not  to 
be  overlooked.  When  it  cones  to  aofv  errora. 


relative  values  ere  mort  important  than  their 
absolute  counterparts.  Mi 1 d  deviations  from  the 
norm  are  particularly  ominous.  They  should  be 
examined  ir.  detail.  Physical  locations  ar.d 
electrical  links  exist  he  noted.  A  top  logical  or 
functional  pattern  may  emerge  ar.d  thus  reveal  a 
source  o'  soft  errors. 

Conclusion 

Anaicg  insti  umentation  for  Vb£!  diagnostics  is 
still  in  its  infancy.  As  yet,  there  are  no  uni¬ 
versally  satisfactory  techniques;  cr.e-  fits  tr.f 
method  to  the  Job,  settling  f cr  all  lines  cf  limi¬ 
tations  and  inaccurac  les.  l’seucc  Ana  lor.  Test::.t 
is  slow  ard  destructive,  but  it  yields  results 
where  everything  else-  fails. 
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